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Pyejace 


Solutions to Problems in the field of digital Image Processing generally requlre 
extensijve experimental work involving software Simulation and testing withb jarge Sets 
of sample images. Although algorithm develjopment typically ls based on theoreticaj 
underpinnings, the actual implementation of these algorithms almost always Tequlres 
parameter esttmation and,frequentiy algorithm revlsion and comparnson of candlidate 
solutions. Thus, selection of a flexible, comprehensive, and wejll-documented softtware 
development environment is a key factor that has important Implications in the cost， 
development time, and portabllity of image processing Solutlons，. 

In spite of its Importance, SUrpIisingly little has been written on this aspect of the 
field in the form of textbook material dealling with both theoretical Principles and so 妨 - 
ware Implementation of digital image processing concepts. TIhis book was written for 
just this Purpose. Hts main objective is to prIovide a foundation for implementing image 
processing algorithms Using modern Software tools.A complementary Objective was to 
Prepare a book that is seif-contained and easily readable by individuals with a basic 
background in digital Image processing, mathematical analysis, and computer Pro- 
gramrming, aljatajleveltypical of that found in a juniorsenior curricuium in a technl- 
cal discipline.Rudimentary knowledge of MAILAB also ls desirable. 

To achleve these Objectives we felt that two key ingredients were needed, TIhe 
first was to Select Image Processing material that 1$ representative of materlal cov- 
ered in a formal course of instructlon in thils field. IThe second was to Select soft- 
ware tools that are well Supported and documented, and which have a wide range 
of appPlications ln the“Teal”world. 

Jo meetthe firstobjective,most of the theoretical concepts in the following chapters 
were Sejlected from Dial zaage Proceysing by Gonzajlez and Woods, which has been 
the choice jntroductory textbook used by educators all over the world for over two 
decades.Jhe software tools selected are from the MAILAB [Image Processing TIboibox 
(IPT),which simijlarly occupies a position of eminence in both education and industrial 
applications.A basic strategy followed in the preparation of the book was to provide 3 
seamjless integration of well-established theoretical concepts and their Implementation 
using state-of-the-art Software tools. 

The book is organized along the same lines as Digiia zaage Procesyi118.In this way， 
the reader has easy access to a more detailed treatment of all the image processing 
concepts discussed here, as well as an Up-to-date Set of Teferences for further reading. 
Following this approach made it possible to present theoretical material in a Succinct 
manner and thus we were able to maintain a focus on the sottware Implementatlon as- 
pects of image pIocessing problem solutions. Because lt works in the MAILAB com- 
puting environment,the Image Processing Toolbox offers some Significant advantages， 
not only in the breadth of its computational tools, but also because it ls Supported 
under most operating Systemas in use today A unidque feature of thls book is its empha- 
Sis on Showing how to develop new code to enhance existing MATILAB and IPT func- 
tonality This is an important feature in an area Such as image processing, which, as 
noted earlier, js characterized by the need for extensive algorithm development and 
expemnmental work. 

Atter an introduction to the fundamentals of MATLAB functions and program- 
ming, the book Proceeds to address the mainstream areas of image Processing, The 
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major areas covered include intenslty transformations,linear and nonlinear Spatial 结 - 
tering, filtering in the frequency domain， image restoration and registration, cojlor 
image processing, wavelets, image data compression,morphological Image PrOcessing， 
image Segmentation, region and boundary representation and description, and object 
recosgnition. This mmateriaj 1$ complemented by numerous ilustrations of how to solve 
image processing problems using MAILAB andIPT functions. In cases where a func- 
tion did not exist, a new function was wrTitten and documented as Pat of the instruc- 
tional focus of the book. Over 60 new functions are included in the following chapters， 
These functions increase the scope of IPT by approximately 32 Percent and also Serve 
the important purpose of further ilustrating how to implement new image Processing 
software Solutions. 

The material is presented n textbook format, not as a software inanual. Althousgh 
the book is self-contained, we have established a companion Web site (See Section 1.5) 
designed to provide Support in a number of areas. For students following a formal 
course of study or individuals embarked on a PIOgTam of seif study, the site contalns 
tutorials and reviews on background material, as well as Projects and image databases， 
including all images in the book. For instructors, the site contains clasSToOoIm PIeSenta- 
tion materials that include PowerPoint slides of all the images and graphucs Used in tbe 
book. Individuals already familiar with image Processing and IPT fundamentals will 
find the site a useful Place for Up-to-date references, neW implementation technlques， 
and a host of other support material not easily found elsewhere. All purchasers of the 
book are eligible to download executable files of all the new functions developed in 
the text. 

Asistrue of most writing efforts of this nature,Progress continues after work on the 
manuscript stops. For this reason, we devoted significant eftort to the Selection of ma- 
terial that we believe is fundamental and whose value is jikejly to remain applicable In 
a rapidly evolving body of knowledge. We trust that readers of the book will benetit 
from this effort and thus find the materialtimely and useful in their work， 
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Dilgital image PIOcesSing is an area characterized by the need for extensive ex- 
perlimental work to establish the viability of proposed solutions to a_ givemn 
Prioblem. In thls chapter we outjine how a theoretical base and state-of-the-art 
software can be integrated into a prototyping environment whose objective is 
to provlide a set of well-supported tools for the solution of a broad class of 
problems in digital image Processing. 





Background 


An Important characteristic Underlying the design of image processing SYS- 
tems is the Significant level of testing and experimentation that normally is re- 
qulred before arriving at an acceptable solution. This characteristic impjlies 
that the ability to formujate approaches and quickly prototype candidate solu- 
tons generalljy Plays a major role in reducing the cost and time required to 
arrlve at a Vlabjle System Implementation. 

Little has been written in the way of instructional material to bridge the gap 
between theory and appjlication in a well-supported software environment. The 
main objective of thijs book is to integrate under one cover a broad base of the- 
oretical concepts with the knowiedge required to implement those concepts 
SIng State-oft-the-art Image processlng Software tools. The theoretical underpin- 
nings of the material in the following chapters are mainjy from the leading text- 
book in the field: Digiial Jaage Processing, by Gonzalez and Woods, Published 
by Prentice Hall.The software code and supporting toois are based on the lead- 
Ing software package in the field: The MA47ZL4 1aage Processing 7Dolbpox' 
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from The MathWorks, Inc. (see Section 1.3).The material in the PITesent book 
shares the same design, notation, and style of Presentation as the Gonzalez- 
Woods book, thus simplifying cross-referencing between the two， 

The book is self-contained. To master its contents, the reader Should have 
introductory preparation in digital image Processing, either by having taken a 
formal course of study on the subject at the senior or first-year graduate level， 
of by acquiring the necessary background im a PTOSgTarn of self-study. It is as- 
sumed also that the reader has some familiarity with MAITLAB, as weli as 
rudimentary knowledsge of the basics of compnuter programming， Such as that 
acquired in a sophomore- of juniorlevel course on Programrning in a technl- 
cally oriented language. Because MAILAB is an arTay-oriented 1]anguage， 
basic knowledge of matrix analysis also is helpful 

The book is based on principles. Itis organized and presented in a textbook 
format, not as a manual, Thus, basic ideas of both theory and softtware are eX- 
plained prior to the development of any new Programming concepts. jhe ma- 
terial is jllustrated and clarified further by numerous examples ranging ftrom 
medicine and industrial inspection to remote Sensing and astronomy LIbls ap- 
proach allows orderly progression from Simple concepts to sophisticated 1m- 
plementation of image processing algorithms， 甩 Oowever， readers already 
familiar with MATLAB,IPT, and image processing fundamentals can Proceed 
directly to specific applications of interest, in which case the functions in the 
book can be used as an extension of the family of IPT functions. Alil new func- 
tions developed in the book are fully documented, and the code for each 1S 
included either in a chapter or in Appendix (. 

Over 60 new functions are developed in the chapters that follow. IEhese 
functions complement and extend by 35% the set of about 175 functions in 
IPT. In addition to addressing specific applications,the new functions are clear 
examples of how to combine existing MATILAB and IPT functions with new 
code to develop prototypic solutions to a broad spectrum of problerms ln digl- 
tal image processing.The toolbox functions, as well as the functions developed 
in the book,run under most operating systems. ConSult the book Web slte (see 
Section 1.5) for a complete list. 
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An image may be defined as a two-dimensional function, 帮 x, 7) whereY and 
y are spafial coordinales, and the amplitude of 三 at any Palr of coordinates 
(xy is called the iniensity or 8ray level of the image at that potnt， When xy y， 
and the amplitude values of 矿 are all finite, discrete quantities，we call the 
image a digial image.The field of digital iaage Procesyin8g refers to processing 
digital images by means of a digital computer.Note that a digital Image is COm- 
posed of a finite number of elements, each of which has a Particujar location 
and value. These elements are referred to as Picftre eleaemiy, ;7108e elerte1155， 
pels,and pixre1. Pixel is the term most widely used to denote the elements of a 
dipgital image. We consider these definitions formally in Chapter 2， 
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Vision ls the most advanced of our Senses, So lt ls not SUIPTIising that images 
play the single most Important role In human perception. However, unlike hu- 
mans, Who are jimlted to the viSual band of the electromagnetic (上 EM) spec- 
trum, Imaging machines cover almost the entire EM spectrum, ranging from 
gamma to radio waves. Ihey can operate also on images generated by Sources 
that humans are not accustomed to assSoctating with Images. These include ul- 
trasound, electron microscopy and computer-generated images. Thus, digital 
image PTOCceSsslnDg encompasses 8 wide and varied field of applications. 

There 1s no general agreement among authors regarding where jimage PTro- 
cessing Stops and other related areas, Such as Image analysis and computer Vi- 
sion, Start. Sometimes a distinction is made by defining image Processing as 3 
discipline In which both the input and output of a process are images. We be- 
lieve thls to be a hmiting and Somewhat artificial boundary. For exampie， 
under thls definition, even the trivial task of computing the average intensity 
of an image would not be consldered an image Processing operation. On the 
other hand, there are fields such as computer vislon whose Ultimate goal is to 
Use computers to emujate human vision, Including learning and being abljle to 
make inferences and take actions based on visual inputs. This area itself is a 
branch of artficial intellljgence (AlJ), whose objective is to emulate human in- 
telllgence. JIhe field of Alils in lts earliest stages of infancy in terms of devel- 
opment,， with prTogress having been much siower than originally anticipated. 
The area of Image analysis (also called image understanding) is in between 
mage Processlng and computer vision， 

There are no clear-cut boundaries in the continuum from image processing 
at one end to compnuter vision at the other, However one useful paradigm is to 
conslder three types of compnuterized processes in this continuum: low-, mid-， 
and high-levei Processes. Low-level processes inyolve Primitive Operations 
Such as Image preprocessing to reduce noise, contrast enhancement,and image 
sharpening.A low-level process is characterized by the fact that both its inputs 
and outputs are images. Mid-level processes on images involve tasks such as 
Segmentation (Partitioning an image into Tegions Or objects), description of 
those objects to reduce them to a form suitable for compnuter Processing, and 
classification (Tecognition) of individual objects.A mid-level process is charac- 
terlzed by the fact that its inputs generally are images, but its outputs are at- 
tnibutes extracted from those images (e.g.,edges, contours, and the identity of 
Individual objects). Finally, higherlevel processing involves“making sense”of 
an ensembjle of recognized objects, as in Image analysis, and, at the far end 
of the continuum, performing the cognitive functions normaily associated with 
human vision. 

Based on the preceding comments, we See that a logical Place of overlap be- 
twWeen Image processing and Image analysis is the area of recopgnition of 
individuaj Teglions Or Objects in an image.Thus, what we call in this book digita! 
f008e P1OCesS1118 encOomipasses PIOCesSes whose inputs and outputs are images 
and,imzadditiom,encompasses processes that extract attributes from images,up 
to and including the recognition of individual objects. As a simple illustration 
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to clarify these concepts, consider the area of automated analysis of text, [be 
processes of acquiring an image of the area containing the text, Preprocesslng 
that image, extracting (segmenting) the individual characters, describling the 
characters in a form suitable for compnuter Processing, and ITecognizing those 
individual characters, are in the scope of what we call digital image ProcesSlng 
in this book. Making sense of the content of the page Imay be Viewed as 
being in the domain of image analysis and even compnuter vision, depending on 
the ievel of complexity implied by the statement“making sense.”Digltal 
image processing,as we have defined it,is used Successfully in a broad range of 
areas of exceptional social and economic value. 





遇 E 4 Background on MAILAB and the Image 
Processing Joolbox 


MATLAB is a high-performance language for technical computing. t inte- 
grates computation, visualization, and programrming in an easy-to-use environ- 
ment where Problems and solutions are expressed in familiar mathematical 
notation. Typical uses include the following: 


*。 Math and computation 

*， Algorithm development 

Data acqulsition 

*。 Modejing, simulation, and Prototyping 

*。 Data analysis, exploration, and visuallization 

， Scientific and engineering gTaphilcs 

。 Application development, including graphical user interface building 


MATLAB is an interactive System whose basic data element is an array that 
does not require dimensioning. This allows formulating solutions to many 
technical computing Problems, especialljy those involving matrlx TePresenta- 
tions, in a fraction of the time it would take to write a Program in a Scajar noOn- 
interactive language Such as C OoT ortran， 

The name MATLAB stands for matrix jaboratory,. MAITLAB was written 
originally to provide easy access to Imatrix software deveioped by the LIN- 
PACK (Linear System Package) and EISPACK (Eigen System Package) Pro- 
jects. Today MATLAB engines incorporate the LAPACKR (Linear Algebra 
Package) and BLAS (Basic Linear Algebra Subprograms) libraries, constitut- 
ing the state of the art in software for matrix computation. 

In university environments,MATLAB isthe standard computationajl tool for 
introductory and advanced courses in mathematics, engineerIing, and Science. in 
industry MATLAB is the computational tool of choice for research， develop- 
ment, and anajlysis. MATLAB is complemented by a family of application- 
specific solutions called toolbpoxes.TIhe Image Processing Toolbox is a collection 
of MATLAB functions (callied M-jurctions or M-jies) that extend the capabill- 
ty of the MATLAB environment for the soliution of digital image PIocesSsing 
problems. Other toolboxes that sometimes are used to complement IPT are the 
Signal Processing, Neural Network, Fuzzy Logic, and Wavejet Ioolboxes. 








1.4 加 Areas of Image Processing Covered in theBook 5 


The MATLAB Student Yersion includes a full-featured verslion of 
MATLAB. The Student Version can be Purchased at Signifticant dlscounts at 
university booKkstores and at the MathWorks' Web Site (www.Imathworks.comy). 
Student versions of add-on products, including the Image Processing IJoolbox， 
also are available， 





中 Areas of Image Processing Covered in the Book 


Every chapter in this book contains the pertinent MATLAB and IPTL materlal 
needed to jimplement the image Processing methods discussed, When a MAT- 
LAB or IPT function does not exist to impjement a Specific method, a new 
function is developed and documented. As noted earlier, a compPlete lsting of 
every new function is included in the book. The remaining eleven chapters 
cover materlal in the following are3s. 


Chapter 2: Fundamentajs，This chapter covers the fundamentals of MAILAB 
notation,jindexing, and programming concepts. This materlal SerVes as founda- 
tion for the rest of the book. 


Chapter 3: Intensity Transformations and Spatial Filtering，Ibljs chapter cov- 
ers in detail how to use MATLAB and IPT to implement intensity transfor- 
mation functions，Linear and nonlinear spatial filters are covered and 
ilustrated in detall. 


Chapter 4: Processing in the Frequency Domain。The material in this chapter 
Shows how to use IPT functions for computing the forward and inverse fast 
Fourier transforms (FFTs),how to visualize the Fourier Spectrum and how to 
implement filtering in the frequency domain. Shown ajso ls a method for gen- 
erating frequency domain filters from specified spatial 和 lters. 


Chapter $: Image Restoration， Traditional linear Testoration methods,such as 
the Wiener filter, are covered in this chapter. Iterative, nonjinear methods， 
such as the Richardson-Lucy method and maximum-likeljihood estimation tor 
blind deconvolution, are discussed and ilustrated. Geometric corrections and 
image reglstration also are covered， 


Chapter 6: Color Image Processing，This chapter deals with pseudocolor and 
full-color image processing. Color models applicabjle to digital Image PIOCeSS- 
ing are discussed,and IPT functionaljlity in color Processing 1$ extended vla lm- 
plementation of additional color models. The chapter also covers applications 
of color to edge detection and region Segmentation. 


Chapter 7: Wavelets。In its current form, IPT does not have any wavejlet trans- 
forms.A set of wavelet-related functions compatible with the Wavelet Ioolbox 
is developed in this chapter that will allow the reader to implement al the 
wavelet-transform concepts discussed in the Gonzalez-Woods book， 


Chapter 8: Image Compression。 The toolbox does nothave any data comPIeS- 
Sion functions. In this chapter, we develop a set of functions that can be used 
tor this purpose， 
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Chapter 9: Morphological Image Processing. The broad spectrum of ftunc- 
tions available in IPT for morphological image Processing are explained and 
illustrated in this chapter using both binary and gray-scale 1mages. 


Chapter 10: Image Segmentation. The set of IPT functions available for 
image segmentation are explained and illustrated in this chapter. New ftunc- 
tions for Hough transform processing and reglon growing also are developed. 


Chapter 11l: Representation and Description。 several new functions for ob- 
ject repiresentation and description,incltuding chain-code and polygonal repre- 
sentations, are developed in this chapter, New functions are included also for 
object description, including Fourier descriptors, texture, and moment inVarl- 
ants. These functions compjlement an extensive set of reglon property func- 
tions available in IPT 


Chapter 12: Object Recognition。 One of the important features of thls chap- 
ter js the efficient implementation of functions for computing the Eucjidean 
and Mahalanobis distances. These functions play a central role ln pattern 
matching. The chapter also contalins a comprehenslve discussion On how to 
manipulate strings of symbols in MATLAB.String manipulation and matching 
are important in Structural Pattern recognition， 


In addition to the Preceding material, the book contains three apPendices. 


Appendix A:; Contalins a Summary of all IPT and new image-processing func- 
tions developed in the book. Relevant MATILAB function also are Included. 
This is a useful reference that provides a global overview of all functions in the 
toolbox and the book. 


Appendix B: Contains a discussion on how to Implement graphical USer interT- 
faces (GUJs) in MATLAB.GUfs are a useful complement to the materlal in 
the book because they simplify and make more intultive the control of interT- 
active functionas，. 


Appendix C: New tunction listings are Included in the body of a chapter when 
a new concept js explained. Otherwise the listitng ls included in Appendix (CC， 
This ls true also for listings of functions that are lengthy. Deferring the listing 
of some functions to thls appendix was done primarily to avold breaking the 
fiow of explanations in text mateTial. 


昌 则 The Book Web Site 


An Important feature of this book is the Support contained in the book Web 
Site. JIhe site address 1S 





www.prenhall.comy/gonzalezwoodseddins 
This site Provides Support to the book in the folljowing areas: 


。 Downloadable M-files, Including all M-files in the book 
se。 Tutorials 





1.7 续 The MAILAB Working Environment 


e。 Proiects 

e。 Teaching materlals 

e。 Tingks to databases, including all images in the book 
e。 了 Book updates 

e。 Backpground publications 


The site ls integrated with the Web slte of the Gonzalez-Woods book: 
www.Prenhall.comy/gonzalezwoods 


which offers additional Support on instructtonal and research toplcS， 


和 Notation 
Equations in the book are typeset Using famillar 下 and (GTreek Symbols， 
as in Flxy)=4sin(ux +2y) and bo = tan [7(u /RU All 
MATLAB function names and Symbols are typeset in monospace font, as 10 
fft2(f), 1ogical(A),and roipoly(f，c，)， 

The first occurrence of a MATILAB or IPT function ls highlighted by use of 
the following lcon on the page marglIn: 





AN、 


< 和 人 faRction name 


Similarly, the first occurrence of a new function developed in the book is high- 
lighted by use of the following lcon on the Page margin: 


functzion name 
人 





The Symbol 一 一 spaes js USsed as 3a Visual cue to denote the end of a function 
listing， 

When referring to keyboard keys, we use bold letters, such as Return and 
Tab. We also use bold letters when referring to items on a compPuter Screen OT 
menu, Such as File and 上 dit. 





和 The MATLAB Working Environment 


jn this section we give a brief overview of some important operational aspects 
of using MATLAB. 


1 The MATLAB Desktop 


The MATLAB deskiop is the main MATLAB application window. As Fig. 1.1 
Shows, the desktop contains five subwindows: the Command Window, the 
Workspace Browser, the Current Directory Window, the Command History 
Window, and one or more Figure Windows, which are shown only when the 
user displays a graphic. 
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IGURE 1.1 The MAILAB desktop and lts principal components， 


The Cormmaaand Windom is where the user types MATLAB commands and 
exXpIessions at the prompt (>>) and where the outputs of those commands are 
displayed. MATLAB defines the workspace as the set of variabjles that the 
User creates in a work session. The Workspace Bromser Shows these varlables 
and some information about them. Double-clicking on a variable in the WorkK- 
space Biowser launches the 4rray Editor which can be used to obtain infor- 
mation and in some instances edit certain Properties of the Varlabie. 

The Current Directory tab above the Workspace tab shows the contents of 
the crret directpry, whose patp is Shown in the CurreHt Directo 六 玫 jidoW. 
For example, in the Windows operating System the path might be as follows: 
CAMMATLABAWork, indicating that directory“Work”1is a Subdirectory oft 
the main directory “MATLAB,”which is instajled in drive CC Cllcking on the 
arrow in the Current Directory Window shows a list of recently used Paths. 
Clicking on the button to the right ofthe window allows the user to change the 
current directory， 


1.7 串 The MATLAB Working Environment 9 


MATLAB uses asearcphn patipn to find M-files and other MATLAB-related 
files, which are organized in directories in the compnuter file System. Any fle 
run in MATLAB must reside in the current directory or in a difectory that 
is on the search path. By default, the files Supplied with MAILAB and 
MathWorks toolboxes are included in the Search path. The easlest way to 
see which directories are on the search bath, or to add or modify a Search 
path, is to Select Set Path from the File menu on the desktop, and then use 
the Set Path dialog box, It js good Practice to add any commonliy used di- 
rectories to the search path to avoid repeatedly having the change the cur- 
rent directory. 

The Comajzaza1zzd 已 istory Wirdom contains a Tecord of the commands a USer 
has entered in the Command Window, including both current and PIevioUs 
MATLAB sessions. Previously entered MATLAB commands can be selected 
and re-executed from the Command History Window by right-clicking on a 
command or Sequence of commands. This action launches a menu from whbich 
to Select various options in addition to executing the commands. TIhis ls a usSe- 
名] feature when experimenting with various commands in a WOIk Sesslon. 


1.7.2 Using the MAILAB Editor to Create M-EFiles 


The MATLAB editor is both a text editor speciallized for creating M-files and 
a graphical MATLAB debugger. The editor can appear in a window by ltse 才 
or it can be asubwindow in the desktop. M-files are denoted by the extenslon 
.m, as in Pixeldup,m.JTJhe MATLAB editor window has numerous pull-down 
mentus for tasks such as Saving, viewing, and debugging files. Because it peT- 
forms Some simple checks and also uses color to differentiate betWeen VarIlOUS 
elements of code, this text editor is recommended as the tool of choice for 
writing and editing M-tunctions.To open the editor,type edit at the Prompt in 
the Command Window. Similarly, typing edit filename at the prompt opens 
the ME-file filename.min an editor window, ready for editing.As noted earjl- 
er, the file must be in the current directory, or in a directory in the search path. 


1.7.3 Getting Help 


The principal way to get help onjine' is to use the MATLAB Peip Bromser， 
opened as a separate window either by clicking on the question mark Symbol 
(?) on the desktop toolbar, or by typing helLpbrowser at the prompt in the 
Command Window. The Help Browser js a Web browser integrated into the 
MATLAB desktop that displays Hypertext Markup Language (HIML) docu- 
ments. The Help Browser consists of two panes, the Ahe 友 mavigator pa1te, used 
to find information，and the _ display pare，used to view the informatlon. 
Self-explanatory tabs on the navigator pane are used to perform a Search. 
For example, help on a specific function is obtained by selecting the Search 
tab, selecting Function Naime as the Search Type, and then typing in the func- 
tion name in the Search for field. It is good practice to open the 吾 eip Browser 





"Use of the term online in this book refers to information,such as help files,available in a local computer 
System not on the Internet， 
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at the beginning of a MATLAB session to have help readily available during 
code development or other MAITLAB taskK. 

Another way to obtain help for a specific function is by typing doc followed 
by the function name at the command prompt. For example,typing doc format 
displays documentation for the function called format in the display pane of 
the Help Browser. This command opens the browser 过 it is not already open， 

M-functions have two types of information that can be displayed by the 
user, The first is called the 万 7 jnre, which contains the function name and a 
one-line description.The second is a block of explanation called the 万 ep text 
pock (these are discussed in detail in Section 2.10.1). Typing help at the 
prompt followed by a function name displays both the 责 1 line and the 再 eip 
text for that function in the Command Window. Occasionally,this information 
can be more up to date than the information in the Help browser because it 1 
extracted directly from the documentation of the M-function in question. TIyP- 
ing 1ookfofr followed by a keyword displays all the 也 1 lines that contain that 
keyword. This function is useful when ljooking for a particular topic without 
knowing the names of applicable functions. For example,typing Jookfor edge 
at the prompt displays all the H1 lines containing that keyword. Because the 
HIl line contains the function name, it then becomes posSible to look at specif- 
ic functions using the other help methods. Typing lookfor edge -all at the 
prompt displays the H1l line of all functions that contain the word edge in ei- 
ther the H1 line or the Help text block.Words that contaln the characters edge 
also are detected. For example, the H1l line of a function containing the word 
polyedge in the Hl line or Heip text would also be displayed. 

Ltis common MATLAB terminology to use the term Apejp page When refer- 
ring to the information about an M-function displayed by any of the preceding 
approaches, excluding lookfor. It is highly recommended that the reader pe- 
come familiar with all these methods for obtaining information because in the 
following chapters we often give only representative Syntax forms for MAT- 
LAB and IPT functions. This is necessary either because of Space 1imitations 
or to avoid deviating from a particujar discussion more than is absolutely nec- 
essSary, In these cases we Simply introduce the syntax required to execute the 
function in the form required at that point. By being comifortabjle with onlline 
search methods,the reader can then explore a function of interest In more de- 
tail with littie effort， 

Finally, the MathWorks' Web site mentioned in Section 1.3 contalins a large 
database of help material， contributed functions, and other resources that 
should be utilized when the online documentation contains insufficlent intor- 
mation about a deslired topic， 


i74 Saving and Retrieving a Work 9esslon 
There are Sevetal ways to save and load an entire work SesSlon (the contents 
of the Workspace Browser) or Selected workspace varilables in MATILAB. Ihe 
Simpjiest is as tollows. 

To save the entire workspace, simply right-click on any blank space in the 
Workspace Browser window and Select Saye Workspace As from the mennu 


that apPears. This opens a directory window that allows naming the fijle and se- 
jecting any tolder in the System in which to Save lit. Ihen SimPply click Saye. Io 
save a Seiected variable from the Workspace, select the variable with a lett 
click and then right-click on the highlighted area. Then select Sayve Selection 
As from the menu that appears. This again opens a window from wplich a fold- 
er can be Selected to Save the varlable. Io select multipie variables, use Shift- 
click or control-cllick in the familiar manner, and then use the procedure just 
described for a single varlable.All files are Saved in double-precislon, binary 
format with the extenslon .mat.TIhese Saved fjles commonjly are referred to as 
M4T7Hjes. For example, a Session named,say mywWork_2003_02_10, would ap- 
pcar as the MATI-Hle mywork_2003_02_10.mat when saved. Similarly, a Saved 
image called final_image (which ls a Single variable in the workspace) will 
appear when Saved as fnal_limage.Daat. 

To ioad saved workspaces and/or varlabjles, left-click on the folder icon on 
the toolbar of the Workspace Browser window. TIhis causes a window to open 
from whlich a folder containing the MAT-files of interest can be Selected. 
Double-clicking on a Selected MAT-file or selecting Open causes the contents 
of the fijle to be restored in the WorKkspace Browser window、. 

It is possible to achleve the same results described in the preceding para- 
graphs bytyping save and Load at the prompt, with the appropriate 下 le names 
and path information. TIhis approach is not as convenient, but it is used when 
formats other than those available in the menu method are required. As an 
eXeIClse, tbe Teader is encouraged to use the Help Browser to learn more 
about these two functions. 





间 人 罗 How References Are Organjized in the Book 


Alireferences in the book are listed in the Bibliography by author and date, as 
in soille [2003]. Most of the background references for the theoretical content 
of the book are from Gonzalez and Woods [2002]. In cases where this is not 
true, the appIoPpriate new references are identified at the point in the discus- 
SIOn Where they are needed. References that are applicable to all chapters, 
such as MATLAB manuals and other general MATILAB references, are So 
ldentified in the Bibliography 
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in addition to a brief introduction to notation and basic MATLAB tools, the material 
Im this chapter emphasizes the importance of a comprehensive prototyping environ- 
ment in the soiution of digital image processing problems. In the foliowing chapter we 
begin to lay the foundation needed to understand IPT functions and introduce a set of 
fundamental Programming concepts that are used throughout the book.The material 
Im Chapters 3 through 12 spans a wide cross section of topics that are in the mainstream 
of digital image Processing applications. However, although the topics covered are var- 
ied, the discussion in those chapters follows the same basic theme of demonstrating 
how combining MATLAB and IPT functions with new code can be used to solve a 
broad Spectrum of Image-processing problems. 


里 Summary 二 1 


民 





Petiez0 


As mentioned in the previous chapter,the Powerthat MAILAB brings to dig- 
ital image Processing 1$ an eXxtensive set of functions for Processing mujtidl- 
mensional arrays of which images (two-dimensional Dumerical arTayS) are a 
Special case. The Image Processing Toolbox (IPT) is a collection of functlons 
that extend the capability of the MATLAB numeric computing environment. 
These functions，and the expressiveness of the MATILAB language, make 
many image-processing Operations easy to wiite in a compact, clear manner， 
thus providing an ideal software prototyping environment for the solution of 
image PIocessing PrIOoblems. In this chapter we introduce the baslics of MAT- 
LAB notation, discuss a number of fundamental IPT properties and functions， 
and introduce Programming concepts that further enhance the power of 匡 工 . 
Thus,the material in this chapter is the foundation for most of the material in 
the remalinder of the book. 





Digital Image 人 Representation 


An image may be defined as a two-dimensional function, 帮 x, y 放 wherexz and 
yarespakial (Plane) coorainates,and the amplitude of 太 at any pair of coordi- 
nates (xz, y) is called the iniensity of the image at that point. The term 8ray /eyel 
is used often to refer to the intensity of monochrome images. Color Images aTe 
formed by a combination of individual 2-D images. For example, in the RGB 
color Systetm, a color image consists of three (fed, green, and blue) individual 
component images. For this reason, mmany of the techniques developed for 
monochrome images can be extended to color Images by pitocessing the three 
component images individually. Color image Processing is treated in detall In 
Chapter 6， 


2.1 昌 Digital Image Representation ”13 


An Image may be continuous with respect to the x- and y-coordinates, and 
also in amplitude. Conyerting Such an Image to digital form requires that the 
coordinates, as well as the amplitude, be digitized. Dilpltizinpg thbhe coordinate 
values js called sarapiing; digltizing the amplitude values ls called guCPLZzCKOP， 
Thus, when x, y, and the amplitude values of fare all finite,dqiscrete quantities， 
we call the image a digital iptage. 


2.1. Coordinate Conventions 


The result of sampjling and quantization ls a matrix of real numbers. We UsSe 
two Principal ways in thls book to represent digital images. Assume that an 
image jx,y) ls sampled so that the resulting image has AM rows and N 
columns, We say that the lmage js of size M X N.The values of the coordi- 
nates (X, y) are discrete quantitles. For notational clarity and convenience, We 
Use integer values for these discrete coordinates. In many image ProceSsing 
books,the image origin is defined to be at (xz, y) = 〈0,0). The next coordinate 
values along the first row of the image are (xz, y) = (0,1). Itis important to 
keep in mind that the notation (0, 1) is used to Signify the second sample along 
the first row. lt does not mean that these are the actual values of physical co- 
ordinates when the image was Sarmpled. Figure 2.1(a) Shows this coordinate 
convention. Note thatx ranges from0to AM -landyfromb0toA 一 1iiinin- 
tegeT IDCTeIments. 

The coordinate convention used in the toolbox to denote arrays ls different 
from the Preceding paragraph in two minor ways. First, instead of using (X，y)， 
the toolbox uses the notation {(, cj) to indicate TOws and colaumns. Note, how- 
ever, that the order of coordinates ls the same as the order discussed in the 
previous paragraph, in the sense that the first element of a coordinate tuple， 
(a, bj, refers to arIow and the second to a column. The other difference is that 
the origin of the coordinate System is at (mc) = (1 1);thusrranges from 1 to 
hand c from lto Ain integer increments. This coordinate convention is 
shown in Fig.2.1(b). 
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IPT documentation refers to the coordinates in Fig. 2.1(b) as Pixel coordi- 
jates. Less frequently, the toolbox also employs another coordinate conven- 
tion called spatial coordinatesy, which uses x to refer to columns and y to refers 
to rOws. This is the opposite of our use of variables x and y. With very few exX- 
ceptions, we do not use IPT's spatial coordinate convention in thls book, but 
the reader wil definitely encounter the terminology in IPT documentation， 


.1.2 IJmages as Matrices 
The coordinate system in Fig. 2.1(a) and the preceding discussion lead to the 
following representation for a digitized image function: 
太 0, 0) 太 0, ]) 
0 


(ON 一 了 
/ta 力 = ee 


7(CM 三 0 下 凡 一 瑟 蕊 (人 
The right side of this equation is a digital image by definition. Each element of 
this array is called an ipzage ele1aembl Dicture elemtenb Pixel oT pel The terms 
jzapge and pixrel are used throughout the rest of our discusslons to denote 3 
dipgltal image and lts elements. 

Adigital image can be rebpresented naturally as a MATLAB matrix: 


f(1，1) Ti，2) f(1，N) 
| 
f(W 1) TO 2) … ff(W N 


where f(1，1) = 帮 (0,0) (note the use of a mmonospace font to denote MATI- 
LAB quantities). Clearjy the two representations are identical, except for the 
shift in origin. The notation f(p, q) denotes the element located in row p and 
column q. For example,f(6, 2) isthe element in the slxth Tow and second col]- 
umn of the matrix f, Typicaly we use the jetters M and N, respectivejly, to de- 
note the number of rows and columns in a matrix.A 1xNmatrix ls called a 
rom veclor whereas an Mx1matrix is called a coxa1 yector AT1Tx1matrix is 
3.3CLLQ1. 

Matrices in MATLAB are stored in varlables with names Such as A, a, RGB， 
real_array, and So on,. Variables must begin with a letter and contain only 
letters, numerals, and underscores. As noted in the PTrevious paragraph, all 
MATLAB quantities in this book are written using monospace characters. We 
Use conventional Roman, italic notation, Such as A(x, y),， for mathematical 
eXPIeSSions. 





儿 生 阁 Reading Images 


Images are read into the MATILAB environment using function imread， 
whose Syntax 1S 


imread( 'filenanmne  ) 


en 


ne 
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Formtat 


Recognized 
Name 尖 


也 Xtenmsions 


Description 


的 页 
, jpg, .jpeg 
.gif 
.bmp 
.png 
.XWd 


Tagged Image File Format 

Joint Photographic Experts Group 
Graphics interchange Formatt 
Windows Bitmap 

Portable Network Graphics 

久 Window Dump 





1GIF is supported by imread,but not by imwrite， 


Here, filename is a String containing the complete name of the image file (in- 
cluding any appllicable extension). For example, the comazzza1zd Le 


>>f= imread( chestxray.jpg ); 


reads the JPEG (Iable 2.1) image chestxray into image array f.Note the use 
of single quotes (' ) to delimit the string filename.The semicolon at the end 
of a command line ls used by MAITLAB forsxppressing output. If a semicolon 
iS not included, MAITLAB displays the results of the operation(s) specified in 
that jine. The prompt Symbol (>>) designates the beginning of a command line， 
as it apPpears in the MATLAB Command Window (see Fig. 1.1)， 

When, as in the preceding command line,no path information is included in 
filename, imread reads the file fronm the current directory (see Section 1.7.1) 
and, if that fails, it tries to find the fle in the MATILAB search Path (see 
Section 1.7.1). The simplest way to read an image from a specified directory is 
to include a full or relative path to that directory in filename. For example， 


>>Tf = imnread('D:\myinagesvchestxray ,jpg'); 
reads the Image from a folder called myimages on the D: drive, whereaas 
>> 地 = imread(' ,VmyinmagesVvchestxray.jpg'); 


reads the image from the myimages subdirectory of the current Working di- 
IectoIy [The Current Directory Window on the MATLAB desktop toolbar 
displays MATLAB's current working directory and provides a simple, man- 
ual way to change it. Table 2.1 lists some of the most popujlar Image/graphics 
formats supported by imread and imwrite (imwrite is discussed in 
Section 2.4). 


Function size gives the row and column dimensions of an Image: 


>> Size(f) 
ans = 
1024 1024 


TABLE 2.1 

Some of the 
image/graphics 
formats Supported 
by imread and 
Inmwrite, Starting 
with MATLAB 6.4. 
Earlier Versions 
Support a Subset of 
these formats. See 
onjine help for a 
compjete list of 
Supported formats. 
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This function is particularly useful in programming when used In the following 
form to determine automatically the Size of an Image: 


>> [M，N] = Size(T) 


This syntax returns the number of rows (M) and columans (N) ip the image， 
The whos function displays additional information about an arTay. For in- 
stance, the statement 


>> Whos 二 
glVeS 
Name Size Bytes C1LasSS 
f 1024x1024 1048576 uint8 array 


Grand total is 1048576 elements USing 1048576 bytesS 


The uint8 entry Shown refers to one of Several MAILAB data classes dis- 
cussed in Section 2.5. A semicolon at the end of a whos line has no effect, SO 


norrmally one is not used. 





必 虽 Displaying Images 
Images are displayed on the MATLAB desktop using function imshow， whlch 
has the baslic Syntax: 


imshowf(f，G) 


where fis an image array, and G js the number of intensity levels used to dis- 
playit.If Gis omitted,it defaults to 256 levels. Uslng the Syntax 
imshow(f，[f1Low high] ) 
displjays as black all values less than OF equal to 10w， and as White all values 
greater than or equal to high, TIhe values in betwWeen aTe dispjayed as interme- 
diate intensity values using the default number of levejls. Finally, the SyntaxX 
imshow(f，[ ]) 

sets variable 1ow to the minimum value of array f and high to its maximum 
value. This form of imshow is useful for displaying images that have a ljow dy- 
namic range or that have Positive and negative Values. 

Function pixval is used frequentily to display the intensity Values of indL- 
vidual pixels interactively. This function displays a CUISOT overTlalid on an 


image, As the cutsor is moved over the image with the muUsSe， the coordi- 
nates of the_ cursor Position and the corresponding intensity Values are 


和 和 二 提 


可 
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shown on a display that apPpears below the 和 gure window. When working 
with color images, the coordinates as wejl as the red, green, and blue compo- 
nents are displayed. If the left button on the mouse 1s clicked and then held 
pressed, pixval displays the Euclidean distance between the inltial and curT- 
rentt cUISOT ]ocations. 

The syntax form of interest here is 


pPIXVal 





which shows the cursor on the last Image displayed. Clicking the 入 button on 
the cuIsor window turns lt Oo 不 . 


下 和 AMP 2.1: 
Image Teading 
and dispiaying. 


疆 (a) TIhe following statements read from disk an image called rose_512.tif， 
extract basic information about the imnage, and display lt using imshow: 


>> 千 = imread( rose 5t2.tlif ) 
>> Whos 于 


Name Size BytesS C1Lass 
f 512X512 262144 UInt8 array 
Grand total is 262144 elementSs USing 262144 byteS 


>> Imshow(Tf ) 


Asemicolon at the end of an imshow line has no eftect, So Dormally one 1S 
not used.Figure 2.2 Shows what the output ljooks like on the Screen. [he fgure 
number appPears on the top, jett of the window. Note the various pulj-down 
menus and utility buttons,. They are used for Processes Such as scaling, Savling， 
and exporting the contents of the display window. In Particujar, the 了 djit menu 
has functions for editing and formatting restults before they are Printed or 
Saved to disK， 


FIGURE 2.2 
Screen capture 
Showing how an 
Image appears on 
the MATLAB 
desktop. 

有 Oowever, in most 
of the examples 
throughonut thbls 
book, only the 
Images 
themselves are 
shown. Note the 
上 gure number on 
the top,lett Part 
of the window. 
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If another image, g, is displayed using imshow、 MAILAB replaces the 
image in the screen with the new image. To keep the first image and output a 
second image, we use function figure as follows: 


>> figure，imshow(g) 
Using the statement 
>> jmshow(f)，figure，imshow(g) 


displays both imnages. Note that more than one command can be written on 3 
line,as iong as different commands are properly delimited by commas or Seml- 
colons.As mentioned earjlier, a semicolon is used whenever it is desired to SUP- 
press Screen outputs from a command line. 

(b) Suppose that we have just read an image h and find that using imshow(h) 
produces the image in Fig. 2.3(aj. It is clear that this image has a ljow dynamllc 
range, which can be remedied for display purposes by uslng the Statement 


>> imSshow(th，({ ]) 


Figure 2.3(b) shows the result.The improvement is apPparent. 是 


Writing Images 
Images are written to disk using function imwrite, whbich has the following 
baslc Syntax: 


LImwritelftf， ' filename ) 


With this syntax, the string contained in filename mtust inciude a Tecognlzed 
file format extension (see Table 2.1). Alternatively, the desired format can be 
specified explicitly with a third input argument. For exampje, the followling 
comimand writes fto aTIFF fle named patient10_run1: 


>> imwrite(f， ' patient10_run1 ， 


or alternatively， 


>> imwrite(f，'patient10_runf .tif ) 





0 
Do eye 
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Iffilename contains no path information, then imwrite Saves the file in the 
current Working directory， 

The imwrite function can have other parameters, depending on the flle for- 
mat selected. Most of the work in the following chapters deals either with 
JPEG orTIFF images, So we focus attention here on these two formats. 

A more general imwrite Syntax applicabje only to JPEG lmages 1S 


imwrite(f， 'filenane,jpg' ，'quality' ，q) 


whbere qis an integer between 0 and 100 (the ljower the number the higher tbe 
degradation due to JPEG compression)， 


盟 Figure 2.4(a) Shows an image, f, tyPical of Sequences of Images resulting 
from a given chemical PTocess. It is desired to transmit these Images on a foOU- 
tine basis to a central site for visual and/or automated Inspection. In order to 


_ reduce storage and transmission time, it is important that the images be com- 


Pressed as much as possible while not degrading their visual appearance 
beyond a reasonable level, In this case“Teasonable”means no Perceptible 
false contouring. Figures 2.4(b) through (人 龟 show the results obtained by writ- 
ing image fto disk (in JPEG format), with q = 50, 25, 15, 5, and 0, respective- 
ly For example,forgq= 25 tbhe applicable syntax 1S 


>> imwrite(f， "bubbles25.jpg' ， :quality ' ，25) 


The image forq= 15 [Fig.2.4(d)] has false contouring that is barely visible， 


but this effect becomes quite Pronounced for q9= 5 and 9q = 0.Thus an 


acceptabjle solution with some margin for erTor ls to compress the Images with 
q= 25. In order to get an idea of the compression achieved and to obtain other 
Image fle details, we can use function imfinfo, which has the Syntax 


imfinfo filename 


where filename is the complete file name of the image stored in disk. For 
exXample， 


>> imfinfo bubb1Les25.jpg 


outputs the following intormation (note that some fields contain no informa- 
ton ln thls case): 


FI1Lename : bubb1es25.jpg， 
FileModDate : 04-Jan-2003 12:31:26， 
FilIeSize: 13849 
Fornmat : “jpg 
FormatVersion : 2 
Width: 714 
Height: 682 
BitDepth: 8 
CoLorTYpe : OPFayscale， 
FormatSignature : 


Comment: 1{} 


2 


了 上 XAMPLE 2.2: 
Writing an imnasge 
and Using 
function imfinfo. 
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where FileSize is in bytes.The number of bytes in the original image is com- 
puted Simply by multiplying Width by Height by BitDepth and dividing the 
Tesujt by 8. Ihe resujt is 486948. Dividing this by FileSize gives the comPres- 
slon ratio: (486948/13849) = 35.16. This_ compression ratio was achieved 
whblie maintaining image quajlity consistent with the requirements of the appli- 
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cation. In addition to the obvlious advantages in Storage Space, thls reduction 
allows the transmlssion of approxlmately 39 times the amount of uncom- 
pressed data per Unlt time， 

The information fields displayed by imfinfo can be captured into 3a So- 
called sirzctre variaplie that can be used for subsequent computations. Using 
the Preceding Image as an exampie, and assigning the name K to the structure 
Variabje, we Use the SyntaX 


>> K = imfinfo( bubbles25.jpg' ) ; 


to store lnto varlable K all the information generated by command imfinfo. 


The information generated by imfinfo 1s appended to the Structure variabjle 


by means of jelds, separated from K by a dot, For example, the Image height 
and width are now stored in Structure fieldsK.Height andK.Width. 

Asanilustration, conslder the following use of structure Variable K to com- 
Pute the compresslon ratio for bubpb1es25 .jpg: 


>> K = ImfInfo( bubbles25 ,jpg ) ; 

>> jimage_bytes = KK.Width*K.Height*KkK.BitDepthy/8; 

>> Compressed _ bytes = KK,FlI1eSzzei 

>> Compression_ratio = image_bytes/compressed _ bytes 


compression_ratio = 
35.1612 


Note that imfinfo was Used in two different waySs. [he first was to type 
imfinfo bubbles25.jpg at the prompt，whlich resujted in the information 
beling displayed on the Screen, IJhe second was to type K= imfinfol( bub- 
bles25,.jpg' )，whlch resujted ln the information generated by imfinfo 
being stored in K.IThese two different ways of calling imfinfo are an exampPple 
of co7az11z412d-Hfaction duality, an Important concebt that is explained in more 
detail in the MATLAB online documentation. 圳 


A more general imwrite Syntax appPlicabjle onlyto tif images has the form 


LImwrite(g， 'filename.tif'，':comnpression'， parameter ' ，,，,， 
resolution ，f[coLlres Fowres] ) 


where 'parameter' can have one of the following Principal values: none 
indicates no _ compression; 'packbits' indicates packbits compression (the 
default for nonbinary images); and 'ccitt' indicates ccitt compression (the 
defauit for binary images).The1l X 2array [colres rowresj contains two in- 
tegers that glve the column resolution and Tow resoiution in dots-per-unilt (the 
default values are [72 72]). For example, 这 the image dimensions are in inches， 
colres is the number of dots (Pixels) per inch (dpi) in the vertical direction， 
and similarly for rowres in the horizontal direction. Specifying the resolution 
by asingle scalar, res,is eqtivalent to writing [res res]. 
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油 Figure 2.53(a) is an 8-bit -ray image of a circuit board generated during 
quajlity Inspection. It ls in jpg format，at 200 dpi, The image is of Size 
430 X 450 Pixejls, So lts dimensions are 2.23 X 2.29 inches. We want to Store 
thls image in tif format, with no compression,under the name sf.In addition， 
we want to reduce the Slze of the imasge to 1.3 X 1.9 inches while keeping the 
pixel count at 450 X 450. Jhe following statement yields the desired resujt: 


>> jimwrite(f，sf.tif'， compression'，none' ，fPresSolution' ，,,， 
[300 300] ) 


The values of the vector [colres rowres] were determined by multiplying 
200 dpi by the ratio 2.25/1.5, which gives 300 dpi, Rather than do the compu- 
tation manuaily, we could write 


>> Pres = round(200*2.25/11.5); 

>> :imwrite(f，'Sf.tif'， "compression'，'none' ，'presolution' ，res) 
where function round rounds its argument to the nearest integer. It is impor- 
tant to note that the number of pixels was not changed by these commands. 
Onjy the scale of the image changed,The original 43$0 X 450 image at 200 dpi 
ls of Size 2.23 X 2.25 inches.The new 300-dpi image is identical, except that its 
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450 X 450 pixejls are distributed over a 1.3 X 1.3-inch area. Processes Such as 
this are useful for controlling the Size of an image in a Printed document with- 
out Sacrificing resSolutlon . 峡 


Often, it 1S necessaIy to export 1mages to disk the way they appear on the 
MATLAB desktop. Ihis 1s especially true with plots, as shown iD the next 
chapter, The comtem8 of a fgure window can be exported to disk in two ways. 
The first ls to use the File pull-down mena in the figure window (see Fig.2.2) 
and then choose 了 xport. With thbis Option, the User can select 3a location, file 
name, and tormat. More control over export parameters 1s obtained by using 
the print command: 


print -fno -dfiJIerornat -rresno filename 


where no refers to the figure number in the figure window of interest， 
FiJerformat refers to one of the file formats in Table 2.1, resno is the resolu- 
tion in dpl, and filename ls the name we wish to asslgn the file. For example， 
to exXport the contents of the figure window in Fig.2.2 as a tif file at 300 dpij， 
and Under the name hi_res_rose, we Would type 


>> print -fi1 -dtiff -~-r300 hi_res_rose 


This command sends the file hi_res_rose.tif tothe current directory. 

开 we Simply type print at the prompt, MATLAB prints (to the default 
Printer) the contents of the last figure window displayed. It is possible also to 
specify other options with print, such as a Specific printing device. 





Data Classes 


Aithough we work with integer coordinates, the values of pixels themselves are 
not restricted to be integers in MATLAB.Table 2.2 lists the various daia classey! 
Supported by MAILAB and IPT for representing Pixel values. The first eight 
entries in the table are referred to as Httrmaeric data classes.The ninth entry is the 
canar class and, as shown, the last entry is referred to as the Logical data class. 

Al zarrmeric computations in MATLAB are done using double quantities， 
So thls is also a frequent data class encountered in image processing applica- 
tions, Class uint8 also is encountered frequently especially when reading 
data from storage devices, as 8-bit images are the most common representa- 
tions found in practice, These two data classes, class 1ogical,and, to a lesser 
degree, class Uint16, constitute the prImary data classes on which we focus in 
this book. Many IPT functions, however, support all the data classes listed in 
Table 2.2. Data class double requires 8 bytes to represent a number, Uint8 
and int8 require 1 byte each,uint16 and int16 require 2 bytes, and uint32， 


ee 


MATLAB documentation often uses the terms cata class and claia MDe interchangeably In this book， 
We TIesServe use of the'term type for images.as discussed in Section 2.6. 
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TABLE 2.2 

Data classes. The 
first eight entrles 
are referred to as 
1ILIEHIC Cl]asseS， 
the nlinth entry 1S 
the cAaracter 
class, and the last 
entry 1S of class 
LO8FCaL 


Description 


Double-precision, floating-point numbers in the approximate 
range 一 10308 to 1030 (8 bytes per element). 

Unsigned 8-bit integers in the range [0,255] (1 byte Per eljement). 
Unsigned 16-bit integers in the Tange [0, 65535] (2 bytes Per 
eljement), 

Unsigned 32-bit integers in the range [0, 4294967295] (4 bytes 
per element). 

Signed 8-bit integers in the range [一 128, 127] (1 byte per element)， 
Signed 16-bit integers in the range [一 32768, 32767j (2 bytes Per 
element). 

Signed 32-bit integers in the range [一 2147483648, 2147483647] 

(4 bytes per eljement). 

Single-precision floating-point numbers with values in the 
approximate range 一 103 to 103 (4 bytes per element). 
Characters (2 bytes per element). 

Values are0or 1 (1 byte per element). 


doub1Le 


ULILnt8 
ULnt16 


UILnt32 


Int8 
Int16 


1Lnt32 


SingLlLe 


char 
10gGICal 





int32,and single,require 4bytes each.TIhe char data class holds characters 
in Unicode representation.A chnaracter string is merelya1xnarray of char- 
acters.A 1ogical array contains onjly the values 0 and 1, with each element 
being stored in memory using one byte per element, Logical arTrayS are CTeat- 
ed by using function 1ogical (See Section 2.6.2) or by Using rejational opera- 
tors (Section 2.10.2)， 





Image [Types 
The toolbox Supports four types of Images: 


e。 JIntensity images 
e。 Binary images 
。 Indexed images 
。 RGB Images 


Most monochrome image processing Operations are carried out using blIinary 
or intensity images, SO OUI initial focus is on these two image types. Indexed 
and RGB color images are discussed in Chapter 6. 


.9 JIntensity Images 

An iniensity ;inage is a data matrix whose values have been Scaled to TePTesent 
intensijties. When the elements of an intenslty iage are of class Uint8，OT 
class Uint16, they have integer values in the Iange [0,235] and [0,65533], re- 
spectively. If the image is of class double, the values are floating-polint num- 
bers. Values of scaled, class double intensity images are in the range [0,1] by 
convention， 
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2.6.2 Binary Images 

Binary images have a YeIy Specific meaning in MATLAB.A pinary na8e 1S 
a logical array of 0s and 1S. IThus, an array of 0s and Is whose Values are off 
data class, say uint8, js not considered a binary image In MAILAB, A 
numeric arTay is converted to binary using function 10gical. Thus,ifAis a 
numeric array consjsting of 0s and 1S, we create 3 logical arTray B USIDg the 


statement 
B = 10gical(A) 


If A contains elements other than 0s and ls, use of the logical functlon con- 
verts all nonZero quantities to logical 1s and all entries with value 0 to lo8gical 
0s. Using relational and logical operators (See Section 2.10.2) also crTeates logi- 


cal afrTayS. 
To test 过 an array js logical we use the is10gical functiont: 


IsSlogical(C) 


Ifcis alogical array,this function returns a 1. Otherwise it returns a 0.Loglcal 
arrays can be converted to numeric arrays using the data class coOnyeISlon 
functions discussed in Section 2.7.1. 


26.3 A Note on TIerminojlogy 


Consjderable care was taken in the PIeVvious two sections to clarify the use of 
the terms data ciass and iazage type.In general, we referto an image as beling a 
“data class image_type image”where data_class is one of the entrles 
from Table 2.2, and image_type is one of tpe image types defined at the begim- 
ning of this section. Thus, an image is characterized by potpP a class G1d a type. 
For instance, a statement discussing an “unit8 intensity image”1S SImP1y Fe- 
ferring to an intensity Image whose Pixels are of data class unit8. Some func- 
tions in the toolbox Support all data classes, while others are veIy Specific as to 
what constitutes a valid class. For example, the Pixels in a binary image can 
only be of data class 1ogical,as mentioned earjlier. 





区 发 入 Converting between Data Classes and Image Types 


Conyverting between data classes and image types is a trequent operation in 
IPT applications. When converting between data classes, it is Important to 
keep in mind the value ranges for each data ciass detailed in Table 2.2. 


.7. Converting between Data Classes 
Converting between data classes is straightforward.The general syntax is 


B = data class_nanme(A) 


where data_class_name is one of the names in the first column of Iable 2.2. 
For example, Suppose that Ais an array of class uint8. A _ double-precision 
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arTay, B, ls generated by the command B = doub1le(A).This conversion ls Used 
routinely throughout the book because MAILAB expects operands in Pu- 
merical computations to be doubjle-precislon,floating-point numbers. ICis an 
array of class double in which all values are in the range [0,255] (but possibly 
containing fractional valuesj, lt can be conyverted to an Uint8 array with the 
commandD = Uint8(C)， 

If an array of class double has any values outside the range [0,255] and it 1s 
converted to class uint8 in the manner just described, MAILAB converts to 
0 all values that are jess tpan 0, and converts to 259 all values that are greater 
than 235. Numbers in between are converted to integers by discarding their 
fractional parts. Thus, proper scaling of a double array So that its elements are 
in the range 10,255] is necessary before converting it to Uint8,Asindicated in 
Sectlon 2.0.2, converting any of the numerlc data classes to 10gical resujts in 
an aITay with joglcajl ls In locations where the inmput arTay had nonzero values， 
and logical 0s in places where the inpnut array contained 0s. 


.7.2 Converting between Image Classes and Types 


The toolbox provides Specific functions (Table 2.3) that perform the scaling 
neceSsary to Convert between lmage classes and types,.Function im2uint8 de- 
tects the data class of the input and performs all the necessary scaling for the 
tooljpox to recognlze the data as valid image data. For exXample, conslder the 
following2 x 2imagefofclass double,which could be the result of an inter- 
mediate computation: 


Performing the converslon 
>> 9 = im2uUInt8(f) 
yields the result 
g = 


0 128 
19] 255 


Yajlid Input mage Data Ciasses 


Coliyerts Input to: 


ULnt8 10gical,uint8,uint16,and double 
Uint16 
double (in range [0, 可 ) 
doub]le 


10g1Cal 


Im2uint8 
im2uint16 10gical,uint8,uint16,and double 
doub1】e 

10g9ical.uUint8, uint16,and doub1le 


Unt8,uUint16.and doub1e 


站 at2gray 
Im2doubjle 


工 2bw 
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from which we see that functlon im2uint8 sets to 0 all values in the input that 
are less than 0U, Sets to 253 all values in the imput that are greater than 1, and 
multiplies all other values by 235.Rounding the results of the multiplication to 
the nearest Integer compjletes the conversion. Note that the rounding behavior 
of im2uint8 ls different from the data-class convyersion function uint8 dis- 
cussed in the pfevlous Section, which simply discards fractional parts. 

Conyverting an arbltrary arTay of class double to an array of class doub1le 
ycaled to the range [0, 1] can be accomplished by using function mat2gray 
whose baslc SYyntaxX 1S 


g = mat2gray(A，[Amin，Amax]j) 


where image g has values in the range 0 (black) to 1 (white). The specified pa- 
Tameters Amin and Amax are Such that values less than Aminin Abecome 0in g， 
and values greater than Amax in A correspond to 1 in g. Writing 


>> g = nat2gray(A) 


sets the values of Amin and Amax to the actual minimum and maximurm values in 
A.Ihe Input ls assumed to be of class doub1le.The output also is of class doub1le. 

Function im2doub1le conyverts an input to class double. If the input is of 
class Uint8，uint16, or 10gical, function im2double converts it to _ class 
double with values in the range [0, 1].Itthe input is already of class doub1le， 
im2double returns an arTay that 1S equajl to the input. For example,if an array 
of class double results from computations that yield values outside the range 
[0 ,inputting this array into im2doub1e w 刘 have no effect.As mentioned in 
the preceding paragraph, a double array having arbitrary values can be con- 
verted to a double array with values in the range [0, 1] by using function 
mat2gray， 

As an illustration, consider the class uint8 imape' 


>> h = Uint8([25 50; 128 200]) ; 
Perfcrming the conversion 

>> g = im2doublel(h) ; 

yields the resuljt 

g = 


0.0980 
0.4706 


0.1961 
0.7843 


from which we infer that the conversion when the Input Is of class uint8 is 
done simply by dividing each value of the input array by 233. If the input is of 
class uint16 the division is by 6S535. 


ea 


Section 2.8.2 explains the use of sduare brackets and senlicolons to specity a matrix. 
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Finajly, we consider converIslon between blnary and intenslty lmage types. 
Function im2bw, which has the Syntax 


g = Im2bw(f，FT) 


produces a binary Image, g, from an intensSlty Image,f, by thresholqing. TIhe 
output binary timage g has values of 0 for al pixels in the input Image with 
intenslty values less than threshold T, and 1 for al other Pixels. TIhe value 
specified for T has to be in the range [0, 1j, regardless of the class of the 
input. TIhe output binary Image ls automatically declared as a Logical arTay 
by im2bw. Iwe wiiteg= inm2bw(f),IPTIuses a defaujt value of 0.3 for T. 于 
the input is an Uint8 Image, im2bw dlvides all its PiXejls by 2355 and then ap- 
Plies either the default or a specified threshold. I the input 1 of class 
uint16, the divislon IlS by 065533. If the input is a doub1e lmage, im2bw ap- 
plies either the default or a specified threshold directly. 开 the input 1 a 
10gical array, the output ls identical to the input.A logical (binary) array 
can be converted to a numerical arTay by using any of the four functions in 
the frst column of Jabjle 2.3， 


 (a) We wish to convert the following double image 


>>f= [12;34] 
一 


] 2 
3 4 


to binary Such that values 1 and 2 become 0 and the other two values become 
1. First we convert it to the range [0, 1]: 


>> 9 = mat2gray(f) 
日 = 


0 0.3333 
0.6667 1,0000 


Then we convert lt to blinary using a thbhreshold, say, of value 0.6: 


>> gb = im2bw(g，0.6) 
gb = 


0 0 
1 1 


村 


国光 


:0 


0.0039 
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As mentioned in Section 2.5, we can generate a bljnary arITray directiy using re- 
lational operators (Section 2.10.2). Thus we get the Samie result by writing 


5 DOE 
gb = 


0 0 
| 1 


We could store in a variable (Say, gbv) the fact that gb is a logical array by 
using the 1s10gical function, as follows: 


>> gbv = 1S1Logical(gb) 
gbv = 
1 


(b) Suppose noOw that we want to convert gb to a numerical array of 0s and 
ls of class doub1le.This ls done directly: 


>> gbd = :im2double(gb) 
gbd = 
0 0 
1 


Hgb had been a numeric array of class Uint8, apPlying im2double to it 
would have resulted in an array with values 


9 0 
0.0039 


because im2double woujd have divided all the elements by 255. This did not 
happen in the preceding conversion because im2double detected that the 
input was a logical array, whose only possible values are 0 and 1. Ifthe input 
in fact had been an uint8 numeric array and we wanted to convert it to class 
double while Keeping the 0 and 1 values, we woujd have converted the array 
by writing 


>> gbd = doub1le(gb) 
gbd = 
0 0 
] 
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Finally we Point out that MAILAB supports nested statements, So We could have 和 To access piocks of elements we use MATILAB's colomr notation. For exam- 
started with inagefandarrved at the same resultby using the one-line statement 了 ple, to access the first three elements of v we write 
>> gbd = Im2doub1lel(im2bwf(mat2gray(f)，0.6) ) |; 吉 >> V(1T3:3) coier 5 
or by using partial groupings of these functions, Of course, the entire process Se 
could have been done in this case with a Simpler command: 浊 二 汪汪 
>> gbd = double(f > 2) 了 Similarly we can access the second through the fourth elements 
again demonstrating the compactness of the MATLAB language. ， >> v(2:4) 
er 汪 ans = 
区 发 昌 Array Indexing ER 
MAILAB supports a number of powerful indexing Schemes that Simplify 
array manipulation and ImPpTove the efficiency of programs. In this Section We orall the elements from, say, the third through the last element: 
discuss and justrate basic indexing in one and two dimensions (i.e., vectors 攻 
and Imatrices). More Sophisticated techniques are introduced as needed in sup- 沸 >> v(3:end) 本 
Sequent discusslonsS. 直 . NA 
旭 ans = ， 
汪 5 7 9 


“加 .j Vector Indexing 


As discussed in Section 2.1.2, an arTay of dimension 1 X Nis calledarowyvec- 
IO 六 The elements of Such a vector are accessed Using one-dimensional index- 间 本 
ing.Thus,v(1) is the first element of vector v,v(2) its second element andso 沸 
forth. TIhe elements of vectors in MATILAB are enclosed by sqtare brackets 
and are Separated by Spaces or by commas. For example， 





where end signifies the last element in the vector. If v is a vector, writing 


>> V1{:) 





produces a column vector, Whereas Writing 





>>v= [13579] 汪 
量 >> v(1:end) 
>> V{(2) 和 Produces a TOowW vector. 
让 和 Jndexing is not restricted to contiguous elements. For example， 
3 证 
奸 >> v(1:2:end) 
A Iow vector 1$ conyerted to a column vector Using the tra1spose Operator (，): 法 ans = 
>> 骨 = V, 于 0 
w = 了 
singasingle guote 1 浊 The 1:2:end saysto start at 1,countup by2 and stop when the count 
浊 Teache Ta 
jjpout the period 3 】 s the last element. The steps can be negative: 
coOPtPEieS 雪 e coFjz- 汗 . 
ze rra1yDO5e 环 Pen 己 >> V(end':-2 :| ) 
edata dre redl po 纺 如 
TSDOSes Ca1t pe 7 时 ans = 
ed iierchartgeaby 9 奸 
oee Tzpie 2 和 尘 9 5 1 
副 
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Here,the index count started at the last element, decreased by 2, and stopped 
when it reached the farst element. 
Function Linspace, With SyntaxX 


X = Linspace(a，b，Dn) 


generates a Tow vector x of n eljements linearly Spaced between and including 
a and b. We use this function in several Places in later chapters. 

A vector can even be used as an index into another vector, For example, we 
can pick the first, fourth, and fifth eljements of v using the command 


>> vt([1 4 5]) 
ans 三 
1 7 9 


As Shown in the following section, thbe ability to use a vector as an index Ilnto 
another vector also plays akey role in matrix Indexing. 


.3.2 Matrix Indexing 


Matrices can be represented conveniently in MATLAB as a sequence of roW 
vectors enclosed by square brackets and separated by Semlcolons. For eXamm- 


ple, typing 
>>A= [123;456;789] 


displays the3 X 3matrix 


A = 
1 2 3 
人 4 5 6 
7 8 纪 


Note thatthe use of semicolons here is different from their use mentloned ear- 
lier to suppress output or to write mujltiple commands in a single line. 

We seject elements in a matrix just as we did for vectors, but now we need 
two indices: one to establish a row location and the other for the correspond- 
ing column, For example, to extract the element in the second row, third coj- 
Umn, we wiite 


>> AL(2，3) 
ans = 
6 


| 


”. 测 
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The colon operator 1$S used in matrix indexing to select a two-dimenslional 
block of elements out of a matrix, For example， 


>> C3 = A(:，3) 
C3 = 


3 
6 
9 


Here,use of the colon by ltsejf ls analogous to writing A(1:3,3),which simply 
picks the third column of the matrix. Similarly, we extract the Second row as 
follows: 


>> R2 = A(2，:) 
R2 = 
4 5 6 


Jhe following statement extracts the top two TOowS: 


>> JTJ2 = Al(1:2，13:3) 
Te = 


Jo create a matrix B equal to Abut with its last column set to 0s, we write 


>> B = A; 

>> B(:，3) =0 

B = 
] 2 0 
4 5 0 
7 8 0 


Operations Using end are carried out in a manner Simjlar to the examples 
外 ven ln the Previous section for vector indexing.The following exampPles iilus- 
trate this. 


>> A(end，end) 
ans = 
9 


4 
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>> Al(end，end 一 2 ) 


ans = 


7 
>> A(2:end，end:~21:1) 


ans 
人 4 
昌 7 


Using vectors to index into a matrix provides a Powerful apProach for ele- 
ment Selection. or example， 


>> E= Al 3，!2< 3 


8 9 


The notation A([a bl,[c dj) picks out the elements In A with coordinates 
(row a, column cj, (row a, colummn d), (row b, column cj, and (TOw b, column 
d).Thus, when we letE=Al([1 3]，[2 3]) weareselecting the following ele- 
ments in A: the element in Tow 1 column 2,the element in row 1 column 3,the 
element in Tow 3 column 2, and the element in row 3 columan 3. 

More complex Schemes can be implemented using matrix addressing. A 
particularly useful addressing apProach using matrices for indexing is of the 
form A(D), where Dis a logical array. For exXamPple, 下 


>>D = 10ogical([100;)001;000]) 


:二 
1 0 避 
0 O 1 
0 0 0 
then 
>> A(D) 
ans = 
1 
人 


Finally we point out that use of a single colon as an Index into 3a matrlX Se- 
lects all the elements of the array (on a column-by-columan basis) and arTanges 
them in the form of a column vector.For exampjle, with reference to matriX T2， 


0 
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人 CD ho 上 一 


This use of the colon ls helpful when, for example, we want to find the sum of 
all the elements of a mnatTix: 


>> S= SUm(A(:)) 
S 三 
45 


In general, sum{vV) adds the values of all the elements of input vector v. 于 
amatrix is Input into sum [as in sum(A)],the output is a row vector containing 
the Sums of each individual column of the input array (this behavior is typical 
of many MAILAB functions encountered in later chapters). By using a sin- 
gle colon in the manner just illustrated, we are in reality implementing the 
command 


>> SU(SuUm(A) ) ; 


because use of a Single colon converts the matrix into a vector. 

Using the colon notation is actually a form of jear 加 dexing into a matrix 
or higher-dimensional array. In fact, MATLAB stores each array as a columan 
of values regardless of the actual dimensions.This column consists of the arTay 
columns appended end to end.For example,matrix Ais stored in MATLAB as 


ccoomnhmp、~ 人 人 一 


Accessing A with a single subscript indexes directly into this column. For exam- 
ple， A(3) accesses the third value in the column, the number 7;A(8) accesses 
the eighth value, 6, and so on. When we use the columan noftation, we are Simply 
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下 入 AMPELE 2.5: 
Some Slimple 
Image OPerations 
USIng 3aTT3ay 
indexlng， 


和 四 
让 


和 


FIGURE 2.6 
Resujts obtalined 
Using arTay 
Indexing. 

(aj) Origlinal 
image. (Pb) Image 
flipped vertically. 
(c) Cropped 
Image. 

(d) Subsampied 
image. (e) 人 
horizontal scan 
line through the 
middje of the 


image in (al)， 


addressing al the elements,A(1:end).This type of indexing is a basic Staple in 
vectorizing loops for program optimization, as discussed in Section 2.10.4. 


活 The image in Fig. 2.6(aj is a 1024 X 1024 intensity image,f, of class uint8. 
The image in Fig.2.6(b) was flipped vertically using the Statement 


>> fp = Tf(end:-1:1， 7) 


The image shown in Fig.2.6(c) is a section out of image (a), obtained using 
the command 


>> fc = fT(257:768，257:768) ; 


Similarly，Fig. 2.6(d) shows a subsampled image obtained using the 
statement 


>> fs = fl1:2:end，1:2:end) 
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Finally，Fig. 2.6(e) shows a horizontal Scan line through the Imiddle of 
Fig.2.6(a), obtained using the command 


>> plot(f(512，:)) 
The plot function is discussed in detaijl in section 3.3.1. 庙 


2.8.3 Selecting Array Dimensions 


Operations of the form 
operation(A，dim) 


where operation denotes an applicable MATLAB operation, A is an arTray， 
and dimis ascalar, arte Used frequentiy in this book. For example,SUPPpose that 
Ais an array of size M X N.TIhe command 


>> kK = Size(A，1); 


gives the size of A along its first dimension, which is defined by MAILAB as 
the vertical dimension. TIhat is, this command gives the number of Tows In A. 
Similarly, the second dimension of an array is in the horizontal directlon, SO 
the statement Size(A,2) gives the number of columns in A. A Sin8gielor dr- 
111e13iom is any dimension, dim, for which size(A, dim) = 1. Using these con- 
cepts, we could have written the last command in EXampljle 2.5 3S 


>> plLot(f(Size(f，1)/72，:)) 


MATLAB does not restrict the number of dimensions of an array, SO being 
able to extract the components of an array in any dimension ls an Important 
feature. For the most part, we deal with 2-D arrays, but there are Several in- 
stances (as when working with color or multispectral images) wben it is neces- 
Sary to be able to“stack”images along a third or higher dimension. We deal 
with this in Chapters 6,11,and 12. Function ndims, with syntax 


d= ndims(A) 


gives the number of dimensions of array A. Function ndims never Leturns 8 
Value jless than 2 because even scajafs are considered two dimensional in the 
Sense that they are airfays of SiZe 工 X 工 





和 Some Important Standard Arrays 


Often, it is useful to be ablie to genetate Simple image arrays to try out ideas 


_and to test the syntax of functions during development. In this section we ID- 


troduce Seven array-generating functions that are used in later chapters. 下 
only one argument is included in any of the following functions, the result 1s a 
Square 3aTTay. 
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es。 zeros(M, N) generates anMxNmatrix of 0s of class doub1e. 

e。 ones(M N) generates anMxN matrix of 1s of class doubIe. 

true(M,N) generates an MxN 1ogical matrlix of 1S. 

es。 false(tM,N) generates anMxN1ogical matrx of 0s. 

e。 magic(M) generates an MXxM“maglc Square.”Jhis ls a Square afrray in 
whlich the sum along any Tow, column, or malin diagonal,is the same. Magic 
squares are Useful arrays for testing Purposes because they are easy to 
generate and their numbers are integeTS. 

s。 rand(M, N) generates anMxNIatrix whose entries are Unjformly distrib- 
uted random numbers in the interval [0, 1|]. 

。 randn(M，N) generates anMxNmatrIx whose numbers are norImally dis- 
tributed (Le., Gaussian) random numbers with mean 0 and variance 1 


For exampljle， 


>> A= 5*ones(3，3) 


及 = 
5 冯 . 5 
0 
5 站 
>> magic(3) 
ans := 
8 1 6 
3 7 
4 9 2 
>> B = rand(2，4) 
月 二 二 
0.231] 0.4860 0.7621 0.0185 
0.6068 0.8913 0.4565 0.8214 





AU Introduction to M-EFunction Programming 


One of the most Powerfujl features of the Image Processing Toolbox is ifts 
transparent accegss to the MATILAB programming environment, As will be- 
come evlident shortly MATLAB function programming is flexible and partic- 
ularly easy to iearn， 


.0.1 IM-Eiles 


SO-called AM-jies in MATLAB can be scripts that simply execute a Series of 
MAILAB statements, or they can be functions that can accept arguments and 
can produce one or more outputs. The focus of this section in on M-file func- 
tions. JIhese functions extend the capablilities of both MATLAB and IPT to ad- 
dress Specific, useT-defined apPlications. 


je 


区 


村- 明 
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M-fles are created using a text editor and are stored with a name of the 
form filename.m, SUch as average.m and filter.nm. The components of a 
function M-fue are 


es。 The function deftinition line 
e。 The Hl impe 

es Help text 

。 The function body 

e Comimnents 


The jprctiomr deFprom Ze has the form 
function [outputs] = name(inputs) 


For exampjle, a function to compute the sum and product (two different out- 
puts) of two Images would have the form 


function [s，p] = sumprod(f，g) 


where f, and g are the input images, s is the sum image, and p is the product 
image. The name Sumprod is arbitrarily defined, but the word function always 
appears on the left, in the form shown. Note that the output arguments are en- 
closed by square brackets and the inputs are enclosed by parentheses. if the 
function has a single output argument,it is acceptable to list the argument with- 
out brackets. If the function has no output, only the word function is used， 
without brackets or equal sign. Function names must begin with a letter, and 
the remaining characters can be any combination of letters, numbers and un- 
derscores. No spaces are allowed, MATLAB distinguishes function names up 
to 03 characters long.Additional characters are ignored. 
Functions can be called at the command prompt; for example， 


>> [S，pj = Sumprod(fT，9) | 
or they can be used as elements of other functions, in which case they become 
suDjz7zctions,As noted in the previous paragraph,ifthe output has a Sinpgle ar- 
gument, it is acceptable to write it without the brackets, as 训 
>>yY = Sum(X); 

The Eneis the first text line. Itis a single comazzzent line that follows the 
function definition line.There can be no blank lines or leading spaces between 
the Hl line and the function definition line.An example of an HI line jls 


#% SUMPROD Computes the sum and product of two images ， 


As indicated in Section 1.7.3, the HI1l line is the first text that appears when a 
USer types 


>> help function_ name 


机 

习 人 人 

汉 和 he 二 

3 4、 
人 e P 
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notation 


at the MATLAB prompt. Also, as mentioned in that section, typing 10okfor 
keyword displays all the Hl lines containing the string keyword. TIhls hmne PIO- 
vides important summary information about the Mr-file, so it should be as de- 
Scriptive as Possible. 

elp iexiis a text block that follows the H1l line, without any blank lines in 
between the two. Help text is used to provide comments and online help for 
the function, When a user types help function_name at the Prompt, MAT- 
LAB displays all comment lines that appear between the function definltion 
line and the first noncomment (executable or blank) jine. The help system 1g- 
nores any comment lines that appear after the Help text bjock. 

The 请 mctfiom body contains all the MAILAB code that Perftorms computa- 
tions and assigns Values to output arguments. Several examples of MALLAB 
code are given later in thls chapter， 

All jines preceded by the symbol“s%" that are not the 了 1 line or Help text are 
considered function comzmazertlines and are not considered part of the 再 ejp text 
block.It is permissible to append comments to the end of a line of code. 

M-files can be created and edited using any text editor and saved with the 
extension .min a specified directory, typically in the MATLAB search path. 
Another way to create or edit an Mrfile is to use the edit function at the 
Prompt. For example， 


>> edit Sumprod 


opens for editing the file sumprod .mifthe file exists in a directory that is in the 
MATLAB path or in the current directory If the file cannot be found, MA 
LAB gives the user the option to create it. As noted in Section 1.7.2, the 
MATLAB editor window has numerous pull-down menus for tasks Such as 
saving, viewing, and debugging files. Because it performs some Simple checks 
and uses color to differentiate between various elements of code, this text edl- 
tor is recommended as the tool of choice for writing and editing M-functions. 


z,.10.2 Operators 
MATLAB operators are grouped into three main categories: 


。 Ariithmetic operators that perform numeric computations 
。 Relational operators that compare operands quantitativejy 
。 Logical operators that perform the functions AND,OR,andNOT 


These are discussed in the remainder of this section. 


Arithmetic OPerators 


MATLAB has two different types of arithmetic operations, Matrix QFPpzzaettc 
operafions are defined by the rules of linear algebra. 4rray Ginzaetc opera- 
fioms are carried out element by element and can be used with multidimen- 
sional arrays, The period (dot) character (,) distinguishes array Operations 
from matrix operations. For example,A*B indicates matrix multiplication in the 


traditional sense, whereas A.*B indicates array multiplication,in the sense that 深 ， 
the result is an array, the same Size as A and B, in which each eljement is tbe 昌 
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roduct of corresponding elements of A and B. In other words, iiC = A.*B， 
then C(I,J) =A(I,J)*B(I,J),Because matrix and array operations are the 
same for addition and subtraction, the character Pairs .+ and .- are Dot used. 

When writing an expression Such as B =A,MATLAB makes a “note thatB 
is equal to A, but does not actually copy the data into B unless the contents of 
A change later in the Program, This is an important point because using di 
ferent variables to“store”the same information Sometimes can enhance code 
clarity and readability. Thus, the fact that MATLAB does not duplicate infor- 
mation unless it is absolutely necessary is worth remembering when writing 
MATLAB code. Table 2.4 lists the MATLAB arithmetic operators， where A 


Comments 0 | 


andExamples 


Name ， Function 


_Operator 


AITray and matriX a+b,A+B,ora+A. 
addition 

ArTray and matrix 
Subtractlon 


Array Inultipllication 


Plus(A,，B) 


&a 一 b,A 一 B,A 一 a， 

Or aa 一人. 

C=A.xB,C(I，,J) 
=A(I,J)*B(I,Jd). 

AxB, standard ImatriX 
maltiplication, Of ax*A， 
maultiplication of a scalar 
times all elements of A. 
C=A./B,C(IT，y) 
=A(I,J)/B(I, J)， 
C=A,\B,C(I,J) 

=B(TI, J)/A(I,J). 

A/B is roughly the saimne 3S 
A*inv(B),depending 

on computational accuracy. 
A\VB is TOughly the same 3aS 
inv(A)*B, depending 

on computational accuracy. 
IfCc=A. 8B,then 

C(I, J) = 

A(I, J)^*B(I，J)， 

See onjine help for a 
discussion of thls Operator， 
A,'.Standard vector and 
matrix transpose， 
A'.Standard vector and 
matrix Conjugate transpOSe. 
WhenAi realA. =A'. 
+Als the Same as 0 + A. 
-Ailsthe same as0 一 A 

OFT 一 1*A. 

Discussed in Section 2.8. 


minus(A,，8) 
times(A;, 8) 


Matrix multiplication mtimes(A,，B) 


Aiiray Ilight division ”rdivide(A, 8) 


Array jeft dlivislon Idivide(A,B) 


Matrix right division mrdivide(A, B) 


Matrix left divlslon mldivide(A,B) 


Array POweT Power(A,，8) 


Matrix PowerT mpower(A,，B) 


Vector and matrixX transpose(A) 
transSpOSe 

Vector and matrix 
complex conjugate 
tranSpOSse 

Unary Plus 

Unary minus 


ctranspose(A) 


UpPLUS (A) 
umznus (A) 


Colon 





TABLE 2.4 

Array and matrlx 
arithmetic 
opeTatorsS. 
Computations 
inyolving these 
Operators Can be 
Implemented usling 
the OpPeratorIs 
themselves, as in 
A+B,or using the 
MATLAB 
functions showmn, as 
in plLus (A, 8B),Ihe 
examples Shown 
for arTays USe 
Imatrlices to 
Simplify the 
notation, but they 
are easily 
extendable to 
higher dimensions， 
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【ABLE 2.5 

JIhe Image 
irithrmetic 
Unctions 
supported by IPT 





JDescription 


Funefion 


imadd 
Imsubtract 
ImmuJtIplLy 


Adds two images; or adds a constant to an image. 

Subtracts two images; Or Subtracts a constant fronm an 1mage. 
Mnuitiplies two images, where the multiplication is 

carried out between pairs of corresponding image elements; 
or multiplies a constant times an image. 


Divides two images, where the division is carried out 
between pairs of corresponding image eliements; or divides 
an Image by a constant. 


Compnutes the absolute difference between two images. 
Cormplements an image. See Section 3.2.1. 


Computes a linear combination of two or more images. See 
Section 9.3.1 for an example. 


Imdivide 


Imabsdiff 
Imcomp1lement 
imlincomb 


and B are matrices OF arrays and a and b are scalars. All operands can be real 
Of complex, TIhe dot shown in the array operators is not necessary if the 
Operands are Scalars. 必 eep in mind that images are 2-D arrays，which are 
equlvalent to matrices, So all the operators in the table are apPplicable to 
Images. 

The toolbox sSupports the ;mazage arithmetic functions listed in Table 2.5.Al- 
though these functions could be implemented using MATLAB arithmetic OP- 
erators directly, the advantage of using the IPT functions js that they Support 
the integer data classes whereas the equivalent MATLAB math operators re- 
qulre inputs of class doub1e. 

已 xXample 2.6,to follow uses functions max and min.The former function has 
the Syntax forms 


max(A) 

max(A，B) 
max(A，[ ]，dinm) 
，JTI] = max(..，,) 


中 半 


C 
C 
C 
[C 


In the first form, 这 Ais a vector,max(A) returns its largest element'; Ais a ma- 
trIX, then max(A) treats the columns of A as vectors and returns a Tow vector 
containing the maximum element from each column. In the second form， 
max(A, B) returns an arTay the same size asAand B with the larpgest elements 
taken from A or B. In the third form, max(A，[ ] , dim) returns the largest ele- 
ments along the dimension of Aspecified by scalar dim. For example, max(A， 
[ ]， 1) produces the maximum values along the first dimension (the TOwWS) of 
A.Finally [C, I] = max(.,,.) alsotfinds the indices ofthe maximunm vaiues of 
A, and returns them in output vector I, If there are several identical maximum 
values, the index ofthe first one found is returned.The dots indicate the SyTitaxX 








区 
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used on the right of any of the previous three forms. Function min has the 
same Syntax forms just described. 


盟 Suppose that we want to write an M-function, call it fgprod, that multiplies 
two input Imnages and outputs the product of the images, the maximum and min- 
imum values of the product, and a normalized product image whose values are 
in the range [0,1 .Using the text editor we write the desired function as follows: 


function [p，pPmax，pmin，pn] = improd(f，9g9) 

%IMPROD Computes the product of two images ， 

% [P，PMAX，PMIN，PNj = IMPROD(F，G)t outputs the element-by- 

% element product of two input ;images，F and G，the product 

% maximum and minimum Values，and a normalized product array With 

% Values jn the range [0，1].， The input images must be of the Same 
% Size,， Jhey can be of Class Uint8，unit16，or doublIe， The outputs 
% are of ClLass doub1e， 


fd = double(f) ; 
gd = double(g); 
p = fd.*gdj 


pmax = max(p( 
pmin = minfp( 
pn = mat2gray( 


) ) ; 
) ) 
p) 


Note that the input images were converted to double using the function 
doub1e Instead of im2doub1e because, 这 the inputs were of type uint8， 
im2doub1e would convert them to the range [0, 1]. Presumably we want p to 
contain the product of the original values. To obtain a normalized array, pn, in 
the range [0, I] we used function mat2gray. Note also the use of single-colon 
indexing, as discussed in Section 2.8. 

Suppose that f= [1 2;3 4] andg= [fl 2;2 1].Typing the Preceding 
function at the Prompt results in the following outpnut: 


>> [Pp，pmax，pmin，pn] = improd(f，9g9) 


TIn MATLAB documentationit is customary to use Uppercase characters in the Hl line and in Help text 
when referring to function names and arguments. This is done to avoid confusion between prograrmm 
names/variables and normal explanatory text. 


下 EXAMPLE 2.0: 
Ilustration of 
arlthmetic 
operators and 
functions max and 
中 工 n， 
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下 上 XAMPELE 2.8: 
Loglical operatofrs. 


正 AMPILE 2.9: 
Loglical functions. 


TABLE 2.8 
Logical functions， 


吐 Consider the AND operation on the following numeric arTayS: 


>>A= [120;j045]|; 
>> B= [1 -23;j011]; 
>> A&B 
ans := 
1 ] 0 
] 1 


We see that the AND operator produces a logical array that is of the Same S1Ze 
as the input arrays and has a 1 at locations where both operands are nOnZeIO 
and 0s elsewhere. Note that all operations are done on pairs of corresponding 
elements of the arTays, as before， 

The 0R operator works in a similar manner,. An 0R expression js true L ei- 
ther operand is a logical 1 or nonzero numerical quantity, or 让 they both are 
Ilogical 1s or nonzero numbers; otherwise it is false. The NOT Operator woOIKS 
with a single input.Logically 计 the operand ls true,the NOT operator COnVertS 
it to false.When using NOT with numeric data, any nonZero OPeIand becomies 
0, and any Zero operand becomes 1|. 导 


MATLAB also supports the logical functions Summarized in Table 2.8. [he 
all and any functions are particularly useful in programming. 





留 Considerthe simple araysA = [123;456]andB= [0-11i 002]. 
Substituting these arrays into the functions in Table 2.8 yield the folowing resujts: 


>> Xor(A，B) 


ans 三 


Function Comnmients 


The xor function returns a 1 only 让 both operands are 
jlogically different; otherwise xor Teturns a 0. 
a11 The all function returns al 下 allthe elements in a 


xor (exciuslve OR) 


vector are nonzerol otherwise all returns a 0. This 
function operates columnwise on matrices， 

The any function returns a 1 让 any of the eljements in a 
vector js nonzerol; otherwise any returns 3a 0. Thils 
function operates columnwise on matrices， 
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>> 31(A) 
ans = 

1 ] 1 
>> any(A) 
ans = 

1 1 1 
>> al (BI) 
ans = 

0 0 1 
>> any(B) 
ans = 

0 1 ] 


Note how functions all and any operate on columns of A and B. For instance， 
the first two elements of the vector produced by alLllL(B) are 0 because each 
of the first two columns of B contains at least one 0; the lastielement 1S 1 be- 
cause all elements in the last column of B are DOnZerTo. 册 


In addition to the functions listed in Table 2.8, MATLAB provides a 
number of other functions that test for the existence of Specific conditions 
or values and return logical resujlts. Some of these functions are listed 1 
Table 2.9,A few of them deal with terms and concepts discussed earlier in 
this chapter (for example, see function islogical in Section 2.6.2); others 
are Used in Subsequent discussions,Keep in mind that the functions listed in 
Table 2.9 return a logical 1 when the condition being tested is true; other- 
wise they return a logical 0. When the argument 18 an array, Some of the 
functions in Table 2.9 yield an arTray the Same Size as the argument contain- 
ing logical 1s in the locations that Satisfy the test performed by the function， 
and logical 0s elsewhere. For example, ifA= [1 2;3 1/0],the function 
isfinite(A) returns the matrix [1 1; 1 0],where the0 (false) entry indi- 
cates that the last element of A is not finilte. 


Some Important Variables and Constants 


The entries in Table 2.10 are used extensively iD MATLAB programming. For 
example, eps typicaliy is added to denominators in expresslons to PIevent 
overflow in the event that a denominator becomes zerT0. 
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“ABLE 2.9 
IOme functions 
hat return a 
:DOglcal 1 or a 
"3gical0 
lepending on 
7hether the vajlue 
.Tcondition 1n 
“heir arguments 
Te trUe OT 
alse.See online 
.elp fora 
“omplete list. 


ABLE 2.10 
Some ImPortant 
'arlables and 
0nstants， 


Funection 
isScell(cC) 
isScellLstr(s) 
zsSchar(s) 
isSempty(A) 


1isequal(A，B) 
isfield(S，' namne ) 


Isfinite(A) 
1ISinf(Al) 
1IslLetter(A) 
Is10gical(A) 


IsSmember(A，8) 


isnan(A) 


IsSnumeric(A) 


ispPime(A) 
isreal(A) 
SSpacefA) 


1SSparse(A) 
1ISStruct(S) 


Eunction 


if(or j) 
NaN Or nan 
PI 
Fealmax 
Peamzin 


Comnputenr 
Version 


Description 


True ECls acell array. 

True if sis acell array of strings. 

True lf sls a character StTing. 

True 过 Ais the empty array, 人 |. 

TrueifAandBhave identical elements and dimensions. 
True 让 :name' is afield of structure S. 

True in the locations of array A that are finite. 

True in the locations of array Athat are infinite. 

True in the locations of A that are letters of the alphabet， 
True 下 Ais aiogical array. 

True in locations where elements of A are also in B. 


True in the locations of A that are NaNs (See Jable 2.10 for 
a definition of NaN)， 


True 直 LAlils anurmeric arTay. 
True in locations of A that are Prime numbers. 
True ithe eliements of Ahave no imaginary parts. 


True at locations where the elements of A are whitespace 
characterTs. 


True 过 Alis a sparse IatTixX. 
True ifSis astructure， 





Se Value Returned 
Most recent answer (variable), Ifno output variable ls assigned to 
an expression, MATLAB automatically stores the result in ans， 


Floating-boint relative accuracy TIhls ls the distance between 1.0 and 
the next largest number representable Using double-precislon 
floating point. 


Jmaginary unit, as in 1 + 2i. 


Stands for Not-a-Number (e.g., 010)， 
3.141392033S8979 
Tbhe largest floating-point number that youUIT comPuter can TepITesent. 


The smallest floating-point number that your compntter can 
TepPTresent， 


Your computer type. 
MATLAB version String. 





1 
Tc 
和 鸭 


4 治 


:有 


2 
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Number 人 Representation 


MATLAB uses conventional decimal notation, with an optional decimal point 
and leading plus or minus sign, for Dumbers. Scientific notation uses the letter 
e to specify a power-of-ten scajle factor, Imaginary numbers use either ji or j as 
asuffix.some exampjles of valid number Tepresentations are 


3 -99 0.0001 
9.6397238 ”1.60210e-20 “6.02252e23 
1 -3.14159j 3e51i 


Al numbers are stored internally using the long format specified by the Insti- 
tute of Electrical and Electronics Engineers (IEEE) floating-point standard. 
Floating-point DUmbers have a finite precision of roughly 16 significant deci- 
mal digits and a finite range of approximately 10-308 to 10+308, 


2.10.3 Flow Control . 


The abljity to control the flow of operations based on a set of predefined con- 
ditions is at the heart of all Programming languages. In fact，conditional 
branching was one of two key developments that led to the formulation of 
generalrpurpose compnuters in the 1940s (the other development was the use 
of memory to hold stored programs and data), MATLAB piIovides the elght 


How control statements summarized in Table 2.11.Keep in mind the observa- 


ton made in the previous section that MATLAB treats a loglcal 1 or nonzero 
number as true, and a logical or ntmeric 0 as false. 


3 Deseription 


-Statement 
if if, together with else and elseif,executes a group of 
statements based on a specified logical condition 
for Executes a group of statements a fixed (specified) number of 
times. 
While Executes a group of statements an indefinite number of times， 
based on a specified jogical condition. 


break 
Continuae 


疙 rminates execution of a for or while loop. 
Passes control to the next iteration of a for or while loop， 
skipping any remaining statements in the body of the loop. 


Switch, together with case and otherwise,exectites different 
groups of statements, depending on a Specified value or 
String. 


SWitch 


PetuUcn 
try 本 二 目 Catch 


Causes exectltion to return to the invoking function. 
Changes flow control if an error is detected during execution. 


TABLE 2.11 
上 low control 
statenments. 
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也 和 AMPLE 2.10: 


Condltionaj 
branching and 
introductlon of 
functions error， 
length, and 
nume1. 





If, else, and el1Sel 了 f 
Conditional statement if has the SyDntaxX 
if exXpreSSizohn 
StatementsS 
end 


The expressionis evaluated and, 让 the evaluation yields true,MATILAB ex- 


ecutes one or more commands denoted here as statements, between the if 是 


and end jines. If expressionis false,MATLAB skips all the statements be- 
tweenthe if and end lines and resumes execution atthe line following the end 
line. When nesting ifs,each if must be Paired with a matching end. 


The else and elsejif statements further conditionajize the if statement， 


The general Syntax 1S 


If expreSSIOnT 
StatementsSi 
elSeif expreSSIoOm2 
StatementS2 

el1Se 
StatementSa 
end 


If expressiony is true, statementsy are executed and control 1s transferred - 
to the end statement. If expressiony evaluates to false, then expreSssion2 
is evaluated. If this expression evaluates to true, then StatementS< aIe EXE- 
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% Compute the average 
av = SuUm(A(:))/lLength(A(:)) 


Note that the input is conyerted to 3a 1-D array by Using A(:). In generTal， 
length(A) returns the Size of the longest dimension of an array, A. In thls exX- 
ample， because A(:) ls avector,Length(A) pglves the number of elements of A. 
This eliminates the need to test whether the input ls a vector of a 2-D arTay. 
Another way to obtain the number of elements in an array directly 1s to use 
function nume1, whose SyntaxX 1S 


n = numel(A) 





Thus， ifAis an image,numel(A) gives its number of Pixels. Using thls function， 
the last executable jine of the previous PIOgIarma becormes 


1 sum(A(:))VnumeLI(A) ; 


Binally note that the error function terminates execution of the progranm and 


outputs the message contained within the Parentheses (the quotes Shown 3T6 


Tequired). 器 


for 
Asindicated in Jable 2.11,a for ]joop executes a grOUP of statements 3 SPecl- 
fednumber oft times. TIhe Syntax 1S 


cuted and control is transferred to the end statement. Otherwise (else) 渤 


statements3 are executed. Note that the else statement has no condition， 


The else and elseif statements can appear by themselves after an 1f state- 了 
ment; they do not need to appear in pairs, as shown in the preceding general 汝 - 


syntax. ft is acceptable to have multiple elSseif Statements. 


Suppose that we want to write a function that computes the average Inten- 站 
sity of an image. As discussed earlier, a two-dimensional array f can be com- 了 
verted to a columan vector, v, by letting v = f(:),， Therefore，we want OUT 玫 
function to be able to work with both vector and image inputs. The program 浊 


should produce an error ifthe input is not a one- or two-dimenslonal arTay. 


function av = average(A) 

%AVERAGE Computes the average Value of an array. 

% ， AV = AVERAGE(A) computes the average value of input 
% array，A，which must be a 1-D or 2-D array， 


% Check the validity of the input， (Keep in mind that 
% 8a 1-D array is a Special case of a 2-D array , ) 
if ndims(A) > 2 
error('The dimensions of the input cannot exceed 2，) 
end 


end 








for k = 


加 
n 站 
日 
区 end 
: 


1 for Inoex = Start: Increment: end 


StatementS 


it 1S possible to nest two or more for loops, as follows: 


for Inaexy = StartyT:zincrementTyT end 


StatementsS1 

for Inaex2 = Start2:7ncre1ent2:6mnd 
StatementrS2 

end 


additional 100D1 StatementS 


end 


For example, the folowing loop executes 1l times: 


0 
0:0.1:1 
count = Count + 1 


count = 
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正义 AMPLE 2.11: 
Using afor loop 
to write multiple 


.images to file. 


众 
光 
和 
加 


光 SN 
A 2 AN 


局 SA 
1 
2 
人 
| 


9ee ipe pnelp page 加 r 
Sprintf /fororper 

SyPiax for appiic- 
Gapieto 纺 1 应 mactiom， 


过 the loop increment is omitted, it is taken to be 1.Loop increments also can 
be negative, as in k = 0:~1:-10.Note that no semicolon is necessary at the end 
of afor lne.MAILAB automatically suppresses printing the values of a loop 


index. As discussed in detail in Section 2.10.4, considerable gains in program ， 
exXxecution Speed can be achieved by replacing for loops with so-called : 


Vectorized code whenever possible， 


狂 Example 2.2 compared Several images using different JPEG quality val- 
ues. Here, we show how to write those files to disk using a for loop. Suppose 
that we have an image, f,and we want to write it to a series of JPEG files with 


quallty factors ranging from 0 to 100 in increments of $. Further, suppose that 1 汪 
we want to write the JPEG files with filenames of the form series_xxx,jpg， 肖 : 


where xxx is the quality factor. We can accomplish this using the following 
for 1oop: 


forgq=0:5:100 
filename = Sprintf('series %3d.jpg' ，q); 
imwrite(TfT，filename，'quality'，gq); 

end 

Function Sprintf, whose syntax in this case is 


s = Sprintf( characters1sndcharacters2' ，dq) 


writes formatted data as a string，s,. In this syntax form, characters1 and 清 


characters2 are character Strings, and %nd denotes a decimal number (speci- 


tied by 9) with n digits. In this example, characterst is series_ ,thevalue of 得 
n 1$ 3, characters2 is .jpg,and qhas the values specified in the loop. 是 “下 


While 


A while loop executes a group of statements for as long as the expression 
controlling the loop is true.The syntax is 


While expPressionm 
StatementS 
end 


As in the case of for,while loops can be nested: 


While expressiom1 
StatemenmtsS1 
While exporesszon2 
StatementS2 
end 
acoztzIzonal Jo00D01 StatementS 
end 











For exampjle, the following nested while loops terminate when both a and 
b have been reduced to 0: 


a= 10; 
b = 5; 


。 While a 


a=a 一 1]; 
While b 

b=b 一 1 
end 


end 


本 Note that to control the ljoops we used MATILAB'S convention of treating 3 
numerical value in a jogical context as true when it is nonzero and as false 

.whenitis 0.In other words,while aand while b evaluate to true as long as a 
and b are nonzero. 


As in the case of for loops, considerable gains in Program execution speed 


can be achieved by replacing while loops with vectorized code (Section 


2.10.4) whenever poSsSible. 


break 
As its name Implies, break terminates the execution of a for or while joop. 
_ When a break Statement is encountered, execution continues with the next 
statement outside the loop. In nested loops, break exits only from the inner- 
”Imnost loop that contains it， 


“Continue 
The continue statement passes control to the next iteration of the for of 
while ljoop in which it appears, skipping any remaining statements in the body 
of the loop. In nested loops, continue passes control to the next iteration of 
the loop enclosing 让 ， 


Switch 


This is the statement of choice for controlling the flow of an M-function based 
on different types of inputs.The syntax js 


_ SWitch Switch expression 


caSe Case_expreSSion 
StatemenmtfsS) 

ase {case_expreSssion1，case_6XpressSion2，,..} 
StatemenmtfsS) 

otherwise 


StatementfS) 
end 
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卫 XAMPLE 2.12: 
季 xtracting a 
Subimage from 8 
given Image， 


The switch construct eXecutes groups of statements based on the value of a 


Varlable oOT expresslon. TIhe keywords case and otherwise delineate the 和 
groups. Only the first matching case is executed.f There must always be an 演 
end to match the Switch statement. The curly braces are used when multipie 潭 


exXPIesSions are Included in the Same case statement. As a Simpje exampPple， 
Suppose that we have an M-function that accepts an Image f and converts lt to 
a Specified class, call it newc1lass,. Only three image classes are acceptable for 
the converslon: Uint8,uUint16,and double.TIhe foljlowing code fragment PerT- 
forms the desired converslon and outputs an eITOT if the class of the input 
Image Is not one of the acceptable classes: 


SWi+tch newclIasSs 


caSe ULnt8 

g = im2uint8(f) ; 
Case ULInt16， 

9 = Im2ulnt16(f) ; 
caSe “'doub1e- 

g = im2double(f) ; 
otherwise 


error( Unknown or improper liImage CLasSs，) 
end 


The switch construct 1Ss used extensively throughonut the book. 


组 In this example we write an M-function (based on for loops) to extract a 澡 


rectangujar Sublimage from an image.Although, as shown in the next section， 
we could do the extraction using a Single MATILAB statement, we use the Pre- 
Sent example later to compare tbhe Speed between loops and vectoIlized code. 
The inputs to the function are an image, the Size (Dumber of rows and 
columans) of the subimage we want to extract, and the coordinates of the top， 
left corner of the Subimage. 上 eep in mind that the image origin in MATITLAB 
ls at (1, 1),as discussed in Section 2.1.1， 


function S = SUubim(fT，m， nn，rx， cy) 

%5SUBIM Extracts aa Subimage，S，Tfrom a given Image， 了 ff， 
%5 The Subimage is of Size m-by-n，and the coordinates 
%5 of its top，1lLeft corner are (PrxX，Ccy) . 


S = Zeros(m，n) 

FoOwhigh = PX + 由 一 二) 

CoLlhigh = CYyY +n 一 1T; 

XCcount = 0; 

for PP = Pxirowhign 
XCount = XCcount + 1 
YCoOunt DO; 


Unlike the C Ianguage Switch construct, MATLAB'S switch does not “fall through.”That is, Switch 


exXecutes only the first matching case; Subseqduent matching cases do not execute. Therefore, break State- 
ments are not used， 








和 ， 
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for Cc = Cy:colhigh 
YCount = YCOunt + 1 
S(XCount，Yycount) = fT(P，C) 
end 
end 


小 


In the following section we give a signijficantly more efficient implementation 
of this code. As an exercise, the Teader Should implement the preceding Pro- 
gram using while lnstead of for loops. 尘 


2.10.4 Code Optimization 


As discussed in some detail in Section 1.3, MAILAB is a programming jan- 
guage Specifically designed for array operations. Iaking advantage of thls fact 
whenever possible can resujt in significant increases in computational Speed. 


JInthis section we discuss two important apProaches tfor MATLAB code opti- 
Inization: vectorizirg loops and prealiocating arTayS. 


Vectorizing Loops 


Yectorizing Simply means converting for and while loops to equlvalent Vec- 
tor or matrix operations. As will become evident Shortiy, vectorlzation Can Te- 
sult not only in Significant gains in computational Speed, but lt also helps 
improve code readabijlity.Although multidimensional vectorization can be dif- 
ficult to formulate at times, the forms of vectorization used in lmage PITOCeSS- 


_ing generally are Stralghttforward. 


We begin with a Simple example. Suppose that we want to generate 3a 1-D 


function of the form 


F(x) = 4Ssin(xz/27) 


forxy = 0,1,2,,..,AM -1.Aforlooptoimnplementthis computation 1S 


Array indices In MATLAB cannot be 0， 


for X= 1:MN 拖 
= AxSin((xX 一 1)/(2*pi)y) 


f(X) 
end 


However this code can be made considerably more efficient by vectorizing it; 
that is, by taking advantage of MATLAB indexing, as follows: 


XxX=0O:M 于 
f = ArxSsin(X/A(2*pi)); 


As this Simple example illustrates，1-D indexing generally is a Simple 
prIocess. When the functions to be evaluated have two variables, optimized 
indexing is slightly more subtle.MATLAB provides a direct way to implement 
2D function evaluations via function meshgrid, which has the syntax 


[C，R] = meshgrid(c，) 


、 | eshgrid 
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This function transforms the domain specified by rowm vectors c and Fr into ar- 
raySs Cand Rthat can be used for the evaluation of functions of two variabjles 


and 3-D Surface plots (note that columns are listed first in both the input and 半 


output of meshgrid). 


The rows of outpnut array C are copies of the vector c, and the columns of 滞 


the outpnut array R are copies of the vector r. For example, suppose that we 
want to form a 2-D function whose elements are the sum of the squares of the 


vajues of coordinate varlables x andyftorx=0，1，2andy=0，1.The vec- 是 
torr ls formed from the row components of the coordinates:r= [0 1 2].Sim- 滞 
arly, c ls formed from the column component of the coordinates:c= [0 1] :如 


(keep in mind that both r and c are iow vectors here)j. Substituting these two 
vectors Into meshgrid results in the following arrays: 


>> [4，R]= meshgridf(c，F) 


C = 
0 
0 1 
0 1 

R 三 
0 0 
1 1 
2 


The function in which we are interested is implemented as 
2 
which gives the following result: 


用 = 过 


人 
Or fn 


Note that the dimensions of h are length(r) X Length(c).Also note, for ex- 
ample, that h(1,1) =R(1,1)^2+C(11)^2.Thus MATLAB automatically 
took care of indexing h.This is a potential source for confusion when 0s are in- 
volved in the coordinates because of the repeated warnings in this book and in 
manuals that MATLAB arrays cannot have 0 indices. As this simple illustra- 
tion Shows, when forming h, MATLAB used the conienis of R and C for com- 
putations. Ihe indicesy of h, R, and C, started at 1. The power of thls Indexing 
Scheme is demonstrated in the following example. 


3 人 


5 
人 
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时 In this example we write an M-function to compare the implementation of 
the following two-dimensional image function using for loops and vectorization: 


太 t 功 一 4sin(xoxr + Woy) 


forxz = 02 一 Landy=012...,N 一 1Wealsointroduce the 
timing fanctlons tic and toC. 

The function inputs are 4,zo, mW, M and N.The desired outputs are the im- 
ages generated by both methods (they shoujd be identical), and the ratio of 
the time lt takes to Implement the function with for loops to the time it takes 
to implement it using vectorization. The solution js as follows: 


function [Frt，T，g] = twodsin(A，U0，v0，M，N) 

%TWODSIN Compares for 100pSs vsSs、vectorization， 

% The Comparlison is based on implementing the function 

% fx，y) = Asin(u0x + VOy) for X = 0，1，2,...,，M-1and 
% yY=0 1 2:...，N-1.， The inputs to the function are 

% MandNandthe constants in the function， 


% First implement Using for 100ps ， 
s Start timing ， 


for FF = 1:M 
UOX = UOx*( 人 一 1)); 
for C = 13:N 
VOyY = VOx*(c 一 1); 
flr，C) = AxSin(UOXx + vVOYy) ; 
end 
end 


下 寺 可 E 


tiC 


各 End timing， 
名 NOW implement Using vectorization.， Call the Image g， 
tic  s% Start timing . 


0:J 一 1; 
0O:N 一 1; 
[C，R] = meshgrid(c，F)， 
9 = A*Sin(uUO*R + VOx*C) ; 

2 二 DO: 


s Compute the ratio of the two times， 


居 


全 机 


各 End timing. 


rt =ti/(t2+ eps);%Use eps in case t2 is close to 0. 


Running this function at the MATLAB prompt 


”> [rt，f，g] = twodsin(1，1/(4*pi)，1/(4*pi)，512，512); 


上 上 和 AMPLE 2.13: 
Anllustratton of 
the computational 
advantages of 
vectorization, and 
intruduction of 
the timing 
functions tic and 
七 OC. 
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FIGURE 2.7 
Sinusolidal 1mage 
generated In 


Example 2.13. 





ylelded the following value of rt: 


>> 下 十 
二 
34.2520 


We convertthe image generated (f and g are identical) to viewable form Usling 


function mat2gray: 

>> g = mat2gray(g) j 

and display it using imshow， 

>> ImSshow1(g) 

Figure 2.7 Shows the resujt. 测 


The vectorized code in Example 2.13 runs on the order of 30 tmes fasteT 
than the impliementation based on for loops. JIhis is a significant comPputation- 
al advantage that becomes increasingly meaningful as relative execution times 
become longer. For example, 让 MandN are jarge and the vectorlzed Program 
takes 2 minnutes to ITrun, it woujd take over 1 hour to accompjlsh the same task 
Using for ljoops.Numbers like these make it worthwhile to vectorize as much of 
a Program as possible, especially 这 routine use of the program in envisloned. 

The preceding discussion on vectorization is focused on computations in- 
volving the coordinates of an image. Often, we are interested in extracting and 
processing regions of an image. Vectorization of Programs for extracting such 
regions is particularly simple 这 the region to be extracted is rectangujar and 
encompasses al pixels within the rectangle, which generally is the case in this 
type of operation. The basic vectorized code to extract a Tegion, s,of SiZe mxn 
and with its top left corner at coordinates (Frx，cy) is as follows: 


rowhigh = PX 十 站 一 1; 
colhighn = cy+nhn 一 1; 











.时 
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s = f(rx:rowhigh，cy:colhigh) ; 


where fis the image from which the region is to be extracted. The for ljoops to 
accomplish the same thing were already worked out in Example 2.12. IJmple- 
menting both methods and timing them as in Example 2.13 would show that 
the vectorized code runs on the order of 1000 times faster in tbls case than the 
code based on for ljoops. 


Preallocating Arrays 


Another Simple way to improve code execution time is to Preallocate the Slze 
of the arrays used in a PITogram. When woiking with numeric oT 1jogica] arrays， 
preaiiocation simply consists of creating arITays of Us with the PITOPer dimen- 
sion，For example, 诺 we are worKking with two images, f and g，of Size 
1024 X 1024 pixels, preallocation conslsts of the Statements 


>> = zeros(1024); gg = zeros(1024) | 


Preallocation also helps reduce memory fragmentation when working with 
large arrays. MemoIy can become fragmented due to dynamic memory ajjoca- 
tion and deallocation.The net result is that there may be sufficient physical merm- 
ory available during compnutation, but not enough contiguous memory to hoid a 
large variable. Preallocation helps prevent this by alowing MAILAB to reserve 
sufficient memory for large data constructs at the beginning of a computation， 


.0.5 Interactive IO 


Often, it is desired to write interactive M-functions that display information 
and instructions to users and accept inputs from the keyboard. In this sectlon 
we estabjlish a foundation for writing Such functions. 

Function disp is used to display information on the Screen. Its Syntax is 


disp(argument ) 


if argument is an array, disp displays its contents. If argument is a text String， 
then disp displays the characters in the String. For example， 


>>A= ff2; 34]; 


>> displ(A) 
1 2 
3 4 
>> SC = 'Digital Image ProcesSSsing ，; 


>> disp(Sc) 

Digital Image Processing. 

>> disp('This is another way to displLay text，) 
This is another Way to disp1Lay text， 


ee 4PDPendr 总 帮 r 
delails om COHSITLC- 
8 SGQPDAECGE IEr 
iterfaces (CD 





60 “Chapter 2 路 Fundamentals 








< 没 2 六 


2 





ee Section 72.4 1or 
2 delaiied disclssior 
Oj srring OperaiioHnz 


Note that only the contents of argument are displayed, without words like 
ans =, which we are accustomed to seeing on the Screen whben the value of a 
variable ls displayed by omaitting a semicoion at the end of a command line. 

.上 unction input ls used for inputting data into an M-function. The basic 


SYntaX 1S 


t = Input( ' message ') 


Tbls function outputs the words contained in message and waits for an inpnut 


from the user followed by a return, and stores the input in t.The input can be 
a Single number, a character string (enclosed by single quotes), a vector (en- 
closed by Square brackets and elements separated by spaces or commas), a 
matrix (enclosed by square brackets and IOwS separated by semicolonsj, or 
any other valld MATLAB data structure.The syntax 


t = input( message'，'S') 


outpnuts the contents of message and accepts a cparacter String whose ele- 
ments can be Separated by commas or Spaces. This Syntax is fiexible because 让 
allows mnaultiple individual inputs. If the entries are intended to be numbers,the 
elements of the string (which are treated as characters) can be converted to 
nurmbers of class double by using the function str2num, which has the syntax 


n = Str2numtt) 
For example， 


>> 七 = Input( Enter your data: '，'S') 
Enter Your data: +1，2，4 
芷 :三 
124 
>> ClLass(t) 
ans = 
char 
>> Sizet(t) 


号 = 
1 5 


>>n= Str2numt(t) 
门 ” 三 











>> Size(n) 
ans = 

1 3 
>> C1ass(n) 


ans = 
double 


Thus, we see that tl8a1 xx 5character array (the three numbers and the two 
spaces) andnigsal X 3vector of numbers of class doub1e. 

Itthe entries are a mixture of characters and numbers, then we Use one of 
MATLAB s string processing functions. Of particular interest in the present 
discusslon ls function strread, which has the syntax 
了 ormat ， param' ，'valLue ') 


[DSS .] = Strread(cSstr， 


This function reads data from the character string cstr, using a specified 


format and param/value combinations. In this chapter the formats of interest 


are %f and %q,to denote foating-point numbers and character strings, respec- 


tively. For param we use delimiter to denote that the entities identified in 


format wbe dejimited by a character specified in value (typically a comma 
-orspace), For example,suppose that we have the string 


>t= '12.6，x2y，z'，; 


_To read the elements of thlis input into three variables a，b，and c, we Write 


aa= 


>> [a，b，c] = Strread( 廿 ，'%f5%q%gq ' ， delimiter " ， "，') 
12.6000 
X2Y， 

C 二 
1 


Output ais of class double:the quotes around outputs x2y and z indicate that 


and care cell arrays, which are discussed in the next section, We convert 


them to character arTrays SImply by letting 


.>>d= char(b) 


X2Y 
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and similarly for c. The number (and order) of elements in the format String 
must match the number and type of expected output variables on the left. In 
this case we expect three inputs: one floating-point number folljowed by two 





contains three elements: a character string,a2 X2matrix,and ascalar (note 
the use of curly braces to enclose the arfaySs). Jo select the contents of a cej] 
arTay We enclose an integer address in curly braces. In thls case, we obtain the 
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character Strings. 

Function strcmp is used to compare strings. For example, SUppose that we 
have an M-function g= imnorm(f，param) that accepts an Image,f,andapa- 渤 >> Cf1} 
rameter param than can have one of two forms: norml ,and 'norm255 . In 证 ans = 
the first instance, f is to be scaled to the range [0, 1]; in the second,itis to be 时 


following results: 





FPCHOP Strcmp 


{S1， S2 ) COP1Pare3 gauss 
no str s1a0d scaledtotherange [0,255].The output should be of class double in both cases. 
S2, ad retaraya >> C{2} 
logical true (1T) 太 The following code fragment accormpjllshes the required normallzatlon: 
Pe S1ri8S Gre EGLGD ans 一 
1ermise 下 re C 0 
logical false (0). f = double(Tf) 0 ] 
f= 和 牛 一 mnffT(:)) 
f = f./max(f(:) ); 
if strcmp(param，':normt ' ) ”ans 一 
上 运 - 于 3 
elSseif Strcmp(param， ' norm255  ) . . . 
g = 255xf， An important property of cell arrays ls that they contain copies of the argu- 
ER ments, not pointers to the arguments. For example, 过 we were WOIKing with 


error( 'Unknown value of param. ) ceij array 


end 
c = {A，8B} 





An error would occur if the value specified in param is not "norml OF 
'norm255'.Also, an error would be issued it other than all 1owercase charTac- 
ters are used for either normajlization factor. We can modify the function to ac- 
cept either lowerT Or Uppercase characters by converting any linput to 
lowercase Using function Lo0wer, as fojlows: 


轨 ” ，iwhichAandBare matrices and these matrices changed sometime later in a 
二 program,the contents of c would not change. 
trzcttresy are Simjlar to cell arrays, in the sense that they aljow groupling of a 
collection of dissimjlar data into a single variable. 再 owever unlike cell arrays 
.whbere cejls are addressed by numbers, the elements of structures are addressed 
by names called jejas,.Depending on the application,using fieids adds clarity and 
Teadability to an M-function. For instance,jletting S denote the Structure variable 
and using the (arbitrary) field names char_string, matrix, and Scalar, the 
data in the preceding example could be organijzed as a Structure by letting 


param = LIower(param) 





Similarly, if the code uses Uppercase letters, we can convert any InpUut character 什 
string to UppPercase Using function Upper: 进 ， 
3S,char_string = 'gauss ' | 


建 S， jiX = ， 
param = upper(param) Smatrix = [1 0 0 1 





下 S$.scalar = 3 
。 辣 Note the use of a dot to append the various fields to the structure variable， 
2.14.6 A Brief Introduction to Cell Arrays and Structures 二 Ilhen,for example,typing S,matrix at the Prompt, would produce 
When dealing with mixed variables (e.g., characters and numbers), we can 昌 
Cell arrays and make use of cell arrays.A ce array in MATLAB is amultidimensional array 浊 >> S.matrixX 
S111CHUres QT7e dd- whose elements are copies of other arrays. For example,the cejl array 间 ans = 


CSSe0d LT detai 避 
ectior 7 了 77 


浊 1 0 
0 1 


CC=({fgauss' ，[I10;i)01]，3} 


4 
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whbich agrees with the corresponding output for cell arrays.The clarity of UsSling 


5:matriIx as OpPposed to cf2} ls evident in this case. This type of readability 
can be Important 过 a function has numerous outputs that must be interpreted 
by a User. 


9U1111T1Q1TV 


The material in this chapter is the foundation for the diseussions that follow. At this 
point, the reader should be able to retrieve an image 条 om disk, process it via simple 
manipujations, dispiay the result, and save it to disk. It is important to note that the key 
jesson from this chapter is how to combine MATLAB and IPT functions with pro- 
gramming constructs to generate Solutions that expand the capabilities of those func- 
tons. In fact,this is the model of how material is presented in the following chapters. By 
combining standard functions with new code, we show prototypic solutions to a broad 
Spectrum of problems of interest in digital image processing， 






”717eviez0 
Theterm sparial domaain refers to the Image Plane itself and methods in this cat- 
”egory are based on direct manipulation of Pixels in an image. In this chapter we 
focus attention on two important categories of spatial domain Processing: 
atensipy (or 8ray-leven) trans1praatiors and Spatial jiliering. The latter approach 
“SOmetimes is referred to as jeigPporpood Process1118, OT SPDatial comvolutiomn. In 
the following sections we develop and ilustrate MATLAB formulations repre- 
Sentative of processing techniques in these two categories. In order to carry a 
consistent theme, most of the exampies in this chapter are related to image en- 
hancement. This is a good way to introduce Spatial processing because enhance- 
_ inent ls highly intuitive and appealing,especially to beginners in the field.As will 
_be seen throughout the book,however,these techniques are general in scope and 





have uses in numerous other branches of digital image Processing. 





Background 


As noted in the preceding paragraph, spatial domain techniques operate di- 
Iectly on the pixels of an image.The spatial domain PITOcesses dlscussed in thils 
chapter are denoted by the expression 


8 yj) = 了 7TLA(c y)] 


where F(x, y) is the input image， 8g8(x, )) is the output (processed) image, and 
/ is an operator on Pdefined over a speclfied neighborhood about point 
5 y). In addition ,Tcan operate on a set of Images, Such as performing the ad- 
dition of K images for noise reduction， 

_Jhe principal approach for defining spatiai neighborhoods about a point 
(xc yj)istouseasquare or rectangular region centered at (x, y), as Fig. 3.1 shows 
The center of the reglon is moved from Pixel to pixel starting, say, at the top, Jeft 
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FIGURE 3.1 入 


nelghborhood of 
Size3 X 3abouta 
point (xy) in an 


Image. 


二 adjUst 


Origln 2 


Image 太 xy 





飞 


.mer and, as it Imoves, it encompasses different neighborhoods. Operator Tis . 


appliea .， ?ach locatlon (x, y) to yield the output, 8g, at that location. Only the 
pixels in the nciszerhood are used in computing the value of g at (xy). 

The remainder of this -PPnter deals with various implementations of the 
preceding equation.Although this vsaabonls simple conceptualy', its compu- 
tational Implementation in MATLAB reduires that careful attention be Paid 
to data classes and value ranges，. 





LE 和 Intensity Transformation Functions 


The simpjest torm of the transformation 7 is when the neighborhood in 
FIg.3.llsofsizel xl(asingle pixel). In this case, the value of g at (x, y) de- 
pends only on the intensity of 太 at that point, and 7 becomes an ztersity oT 
Sray-ievel transformation function. These two terms are used interchangeably， 
when dealing with monochrome (ij.e., gray-scalje) images. When dealing with 
color Images, the term iatersity is used to denote a color image component in 
certaln coOlor Spaces, as described in Chapter 6， 

Because they depend onjly on intensity values, and not explicitly on (xz，y)， 
Intenslty transformation functions frequentiy are written in simplified form as 


3 一 了 (7 
where 六 denotes the intensity of 六 and sy the intensity of g, both at any corTe- 
sponding point (*, y) in the images. 
3.2. Function imadjust 


Function imadjust is the basic IPT tool for intensity transformations of gray- 
Scale Imiages. It has the Syntax 


g= inadjust(f，[1low in high_in]，[FLow_ out high_out]，gamma) 


As jllustrated in Fig. 3.2.this function maps the intensity values in image ff 
to new values in g, Such that values between Low_in and high_in map to 





>> g1 = imadjust(f， 


吨 -o0 gamma < ] gamma = 1 gamrIma > 上 


high_in low_in high_in low_in high_in 


walues between low_out and high_out, Values below low_in and above 
.high_in are clipped;that is, values below low_in map to 1ow_out, and those 
:above high_in map to high_out. The input image can be of class Uint8， 
uint16, or double, and the output image has the same class as the input. All 
inputs to function imadjust, other than f, are specified as values between 0 
andl,regardless of the class of f.Iffis of class uint8, imadjust multiplies 
the values supplied by 255$ to determine the actual values to use; 计 fis of class 
uint16,the values are multiplied by 65$35. Using the empty matrix ([ ]) for 
[low_in high_in] or for [low_out high_out] results in the default values 
fo 1].Hhigh_outis less than 1ow_out,the output intensity js reversed. 


Parameter gamma specifies the shape of the curve that maps the intenslty 


walues in f to create g. If gamma js iess than 1,the mapping is weighted toward 
” _ ”higher (brighter) output values, as Fig. 3.2(a) shows. 二 gamma is greater tbhan 1， 

-the mapping is weighted toward Ilower (darker) output values. If it 1s omiltted 
” from the function argument, gamma defaults to 1 (linear mapping). 


和 葡 Figure 3.3(aj is a digital mammosgram image,f, showing a small leslon, and 
Rig.3.3(b) is the negative Image, obtained using the command 


[0 1]，[1 9) 


This process, which is the digital equivalent of obtaining a photographlc nega- 


tive, is particularly useful for enhancing white or gray detail embedded in a 
large, predominantjy dark region. Note, for example, how much easier lt 1S to 
analyze the breast tissue in Fig. 3.3(b). The negative of an Image can be ob- 
tained also with IPT function imcomp1lement: 


g = incomplLemnent (Tf ) 
Figure 3.3(c) is the result of using the command 
[0 1) 


>> g2 = imadjust(f，[0.5 0.75]， 


which expands the gray Scale region between 0.5$ and 0.75 to the full [0, ] 
Tange. This type of processing is useful for highlighting an intensity band of 
interest. Finally using the command 


>> g939 = imadjust(f，[ ]，[ ]，2); 
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FIGURE 3.3 (ay) 
Original digital 
mammogram. 
(b) Negative 
Image.(c) Result 
of expanding the 
Intensity range 
10.5, 0.75]. 

(d) Result of 
enhancing the 
image with 
gamma = 2. 
(Original image 
SoOUrtesy of G. 下 . 
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One of the Princlipal uses of the log transformation is to compress dynamic 
Tange， For exampjle, lt ls not Unusual to have a Fourier Spectrum (Chapter 4) 
with values in the range [0, 104] or higher. When displayed on a monitor that is 
.scaled linearly to 8 bits, the high values dominate the display, restilting in lost 
:Visual detail for the lower intensity values in the spectrum. By computing the 
10g， a dynamic Tange on the order of for example, 105, is reduced to approxi- 
mately 14, which is much more manageable. 
When performing a logarithmic transformation, it is often desirable to 
bring the resulting compressed values back to the full range of the display For 
8bits the easiest way to do this in MATLAB is with the statement 


， >> gs = im2uint8(mat2gray(g) ) |; 
Use of mat2gray brings the values to the range [0, 1] and im2uint8 brings 
them to the range [0,255].Later, in Section 3.2.3, we discuss a scaling function 
”that automatically detects the class of the input and appjlles the appropriate 
conyversion. 
Thefunction shownin Fig.3.4(ay is called a comtrast-stretching transforma- 
-tion function because it comPresses the input levels lower than mm into a nar- 
TOW Tange of dark levels in the output image; similarly, it compresses the 
values above ma into anarrow band of light levels in the output.The result is an 
image of higher contrast, In fact, in the limiting case shown in Fig. 3.4(b)j, the 
.output is a binary image. This jimiting function is called a tarespnotding func- 
fion, Which, as we discuss in Chapter 10, is a simple tool used for image seg- 
:mmentation. Using the notation introduced at the beginning of this section, the 
”function in Fig. 3.4(a) has the form 
二 
] 十 (pa/r) 


Where represents the Intensities of the inpnut image, sy the corresponding in- 
tenslty values in the output image, and 无 controls the slope of the function 
Jihis equation is implemented in MATLAB for an entire image as 


3 二 了 (7r) 








Produces aresult similar to (but with more gray tones than) Fig.3.3(c) by compress- g9=1./(+(m./double(f) + eps)).^E) 
imgtheiowend and expanding the high end of the gray scale [see Fig.3.3(d)]，。 轩 

各 入 Y 由 是 。 二 本 泛 5 

4 Logarithmic and Contrast-Stretching Transformations 是 5 = To 5 = TOn) 


Logarithmic and contrast-stretching transformations are basic tools for dy- 证 
namilc range manipulation. Logarithm transformations are implemented using 利 
the exXpression 


g = cxrliog(1 + doublef(f)) 





Dark -一 Light 
Dark -< Light 


where c 1S a constant, The shape of this transformation is similar to the gamma 
cuUIVe shown in Fig.3.2(a) with the low values setat0 and the high values setto 和 退 
L on both scales, Note. however,that the shape of the gamma curve is variable， 浊 
whereas the shape of the log function is 丰 ixed. 证 
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FIGURE 3.4 

(aj Contrast- 
Stretching 
transformation， 
(bj Ihresholding 
transformation， 
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了 EXAMPLE 3.2: 
Using alog 
transformaation to 
reduce dynamic 
range. 


a b 

FICURE 3.5 (ay) A 
Fourier Spectrum. 
(b) Resuit 
obtained by 
Pertorming a ljog 
transftormation， 


Note the use of eps (see Table 2.10) to prevent overflow 让 f has any 0 values 
Since the Jimiting value of 7(r) is 1.output values are scaled to the range [0,]]: 
when working with this type of transformation. The shape in Fig. 3.4(a) was : 


obtained with E = 20. 


旺 Figure 3.5(a) is a Fourier spectrum with values in the range 0to 1. X 10% 


displayed on a linearly scaled, 8-bit system. Figure 3.5(b) Shows the result ob- 


talined using the commands 


>> 日 = 


im2uint8(mat2gray(1ogf(1 + double(f)) )); 
>> Imshowfg) 


The visual improvement of g over the original image is qulte evident. 如 


2423 Some Utility M-Functions for Intensity Transformations 
In this section we develop two M-functions that InCOTPorate various aspects 


of the intensity transformations introduced in the Prevlous two Sections. We 晤 
Show the details of the code for one of them to illustrate error checking,to : 
Introduce ways in which MATLAB functions can be formulated so that 3: 
they can handle a variable number of inputs and/or outputs, and to show 。 
typlcal code formats used throughout the book. From this point on, detailed : 


code of new M-functions is included in our discussionas only when the Pur- 


pose ls to explain Specific programming constructs, to jllustrate the use of a 说 
new MAILAB or IPT function, or to review concepts introduced earlier ， 
Otherwise, only tbe syntax of the function is explained, and its code is in- 
cuded in Appendix C. Also, in order to focus on the basic structure of the 
functions developed in the remainder of the book, this is the last section in 
which we Show extensive use of error checking. The procedures that follow 
are typlcal of how error handling is programmed in MATLAB. 





















Tiandling a Variable Number of Inputs and/or Outputs 
Tb check the number of arguments input into an M-tunction we use functlon 


nmn = nargin 


which returns the actual number of arguments input into the M-function. Sim- 
jlarly function nargout is used in connection with the outputs of an M- 


function. The Syntax 1S 
n = nargout 
For exXample,SuUppose that we execnte the following M-tunction at the Prompt: 


5 T = testhv(4，5); 


8 of nargin within the body of this function would return a 2, whlle use of 
na rgout would return a 1 

0 ”Eunction nargchk can be used in the body of an M-function to check 让 the 
correct number of arguments were passed.The syntax is 


msg = nargchk(low，high，number) 


This function returns the message Not enough input parameters Lnumberis less 
than low or Too many input parameters 过 number 1 greater than high. 下 
number is between low and high (incjusive),nargchk returns an empty matrIX.A 
frequent use of function nargchk is to stop execution via the error function 让 the 
incorrect number of arguments is input. The number of actual input arguments is 
determined by the nargin function. For exampjle, consjder the following code 


fragment 
-function 6 = testhv2(X，y，z) 


error(nargchk(2， 3，nargin) ) ; 


了 yping 

>> 上 esthv2(6) ; 

which only has one input argument would produce the erIITOT 
Not enough input arguments， 


and execution would terminate. 
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.This function conyerts image f to the class specified in parameter newc1lass 
and outputs it as g. Valid values for newclass are 'uint8'，'uint16 
and'doup1le . 
Noteinthe following M-function, which we call intrans,how function OP- 
:tions are formatted in the Help section of the code,how a variable number of 
:inputs is handljed, how erTor checking is interleaved in the code, and how the 
elass of the outphit Image is matched to the class of the input. Keep in mind 
When studying the following code that vararginis a cell arIray, so its elements 


Often,it is usefujl to be abjle to write functions in whbich the number of inpnuit 
and/or output argutments ls Varlable. For this, we use the varlables varargin 
and varargout. In the declaration, varargin and varargout must be lower- 
case. For example， 


function [mn，n]j = testhv3(varargin) 


accepts a varliable number of Inaputs Into function testhv3, and 


function [varargout] = teSsthv4(m，n，Pp) 







returns 3a _ variable number of outputs trom function testhv4. I function 
testhv3 had, say, one fixed input argument, x, followed by a variable number 
of Input arguments, then 


function [m，n]j = testhv3(x，varargin) 


would cause varargin to start with the second Input argument Supplied by 
the user when the function is called. SIimjilar comments apply to varargout, 工 
IS acceptabjle to have a function in which both the number of Input and output 
arguments ls Varlable. 
When varargin is used as the input argument of a function,MAILAB sets it 
to a cell array (See Section 2.10.5) that accepts a variable number of inputs bythe 
user. Because vararginlils acell array, an important aspect of this arrangementis ， 
that the call to the function can contain a Imixed set of inputs. For example, as-“ 
suming thatthe code of our hypothetical function testhv3 is equipped to handle : 
it,i woujd be perfectly acceptable to have a mixed set of inputs, such as 
>> [m nj = testhv3(f，[I0 0.5 1.5]，A， 1Label' ); 
where f is an Image, the next argument ls a TOwW Vector of length 3,Alis ama- ， 
trix, and “Iabel' is a character String. This is indeed a Powerful feature that 
can be used to simplify the structure of functions requiring a variety of differ- 


ent inputs. Similar comIments apply to varargout. 


Another M-Function for Intensity Iransftormations 


In this section we develop a function that computes the following transforma- 
tion functions: negative,log, gamma and contrast stretching.These transforma- 
tlons were Selected because we will need them later, and also to iustrate the : 
mechanics involved in writing an M-function for intensity transformations. In . 
writing this function we use function changeclass, which has the syntax 


g = changeclass(newCc1lasSs， 二) 


人 
Po 时 
0 


function g = intrans(tf，varargin) 
- %INTRANS Performs intensity (gray-1level) transformations 


| 


Selected by using curly braces. 


Intrans 
6 = INTRANS(F， neg' ) computes the negative of input image 上 ， 


6 = INTRANS(F，'10g ' ，C，CLASS) computes Cx*l1og(1 + F) and 

multiplies the reSult by (posSitive) Constant C. If the 1Last two 
paranmeters are omitted，C defaults to 1，Because the 10g 1IS USed 
frequently to display Fourier Spectra，parameter CLASS offers the 
option to Specify the Class of the Output as "uint8' or 

Uint16 ， 革 parameter CLASS is omitted，the output is of the 

sanme Class as the input ， 


6 = INTRANS(F， gamma' ，GAM) perfornms 3a gamma transformation on 
the input image Using parameter GAM (a required ;input )， 


6 = INTRANS(F， Stretch'，M，E) Computes a contrast-Stretching 


_ transformation USing the expresSion 1,/(1 + (MA(F + 


eps)),E)，Parameter 其 must be in the range [0，1].， The default 


.Value for M is mean2{inm2double(F))，and the default value forE 


TS 4， 


For the 'neg' ， "gamma'，and 'Stretch' transformations，doub1le 
input images whose maximum value is greater than 1 are Scaled 


Tirst Using MAT2GRAY，0ther images are converted to double first 


USing IM2DOUBLE，For the '10g' transformation，double images are 
transformed Without being scaled; other images are Converted to 


-double first using IMH2DOUBLE . 


The output is of the same class as the input，except jif a 


，d 直 fferent class is specified for the 10g option， 


4 Verify the correct number of inputs ， 
error(nargchk(2，4，nargin) 


和 必 Store the class of the input for Use 1ater， 
classin = Class (于 ) ; 


和 
汪 形 寻 


/3 
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% If the input is of Class doub1e，and it is outside the range 

% f{0，1]，and the specified transformation is not “10g ，convert the 
% input to the range [0，1]， 

jif Strcmpfclass(f)， ' double ) & max(f(:)) > 1&， 


~Strcmp(vyararginf1}， 109g ) 


f = mat2grayfTf) ; 

else 5%5 Convert to double，regardleSss of Cjlass(f)， 
f = im2doublef(f); 

end 


% Determine the type of transformation Specified. 
method = Varargin{1)} 


%5 Perform the intensity transformation Specified ， 
Switch method 
case “neg- 

g = imcompIement (ff) 


CaSe “ “109 

if length(varargin) == 1] 
C = 1; 

elSeif Length(vararginj = 
C = varargin{2}; 

eljseif length(vyararginj = 
C = varargin{f21， 
Classin = varargint3l; 

elSe 
error('Incorrect number of inputs for the 10g option，) 

end 

g=crflogf(1 + double(Tf)))) 


Case “ gamma， 
if length(varargln) < 2 
error('Not enough inputs for the gamma Option， ) 
end 
gam = Varargilnft21}; 
g = imadjust(f，[ ]，{| ]，gam) 1; 


Case Stretch 
if length(varargin) == 1 
% USe defaults， 
站 = mean2(f) | 
E=4.0; 
elseif 1ength(varargin) == 3 
前 = Varargin{f21 
E = Vararginft3h， 
else error('Incorrect number of inputs for the Stretch option.，) 
end 
9=1./+ (nm ATfT+ epshih) FE) 


OtherWiSe 
errorf'Unknown enhancement method，) 
end 


Convert to the Cl1ass of the input image， 
= Changeclass(ClasSsSin，g) 


失 
TD 


让 
CD 


和 5 
8 


村， ie 

























一 国 Asanillustration of function intrans,conslder the Image InD 开 18. 3.0(a), which 
anideal candidate for contrast stretching to enhance the skeletal structure. The 
-Tesult in Fig.3.6(b) was obtained with the following call to intrans: 


> g= intrans(fT，'stretch'，mean2(im2double(f))，0.9)j 
figure，jimshow(g) 


Note how function mean2 was used to compnute the mean value of f directjy 
inside the function call. The resujlting value was used for m. Image f was con- 
.Verted to double using im2doub1le in order to Scale its values to the range 
-[0,1] so that the mean would also be in this range, as required for input m, The 


value of E was determined interactively. 六 


ee 


- An M-Function for Intensity Scaling 


-When working with images, results whose Pixels span a wide negative to posi- 
:tive range of values are common, While this presents no problems during in- 
-termediate computations, it does become an isSSue when we want to Use am 
“8-bit or 16-bit format for Saving or Viewing an image, in which case it often ij 
:desirable to scale the image to the full, maximum range, [0, 255] or [0, 655335|. 
:he following M-function, which we call gscale, accomplishes this. In addi- 
tion, the function can map the output levels to a specified range. The code for 
:this function does not incjude any new concepts So we do not include it here. 
-See Appendix C for the listing. 
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EXAMPLE 3.3: 
Illustration of 
function intPFans. 


m = mean2 (AI) 
COH11DHeS Le 1TeGH11 
faveragej Patfiie OF 
pe elemazer1s OF 
PP 人 


FIGURE 3.6 (al) 
Bone scan 1mage. 
(b) Image 
enhanced using a 
contrast-stretchling 
transformation. 
(Original Image 
courtesy of GO, 下 . 
Medical Systemas.,) 
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The syntax of function gsScale 1S 
g = gscale(f，nmethod，1Low，haigh) 


where fis the image to be scaled.Valid values for method are 'fu118' (the de- 
fault), which scales the output to the full range [0, 255], and 'fu1116 which 
scales the output to the fll range [0, 6$5335|. 开 imcluded, parameters Low and 
high are ipgnored in these two converslions. A_ third valid value of method 1 
minmax' ,in which case parameters 1ow and high,both in the range [0,1,mnust 
be provided. If 'minmax' ls Selected, the levejls are mapped to the range [1ow， 
high].Although these values are Specified in the range [0, 1],the Program Per- 
forms the proper scaling, depending on the class of the Input, and then converts 
the output to the same class as the input. For example,iffls of class uint8 and 
we Specify 'minmax' with the range [0, 0.3], the output also will be of class 
uint8, with values in the range [0, 128|. 玉 fis of class double and its range of 
values js outside the range [0, 1j, the Program converts it to this range beftore 
Proceeding. Function gscale 1 used in nUImeIous Places throughout the book， 





3 Histogram Processing and Function Plotting 


Intensity transformation functions based on information extracted from image 
intensity histograms play a basic role in image Processing, In areas SUch as en- 
hancement, compression, segmentation, and description. IJhe focus of thls Sec- 
tion ls on obtaining, plotting, and using histograms for image enhancement， 
Other appPlications of histograms are discussed In later chapters. 


3.3. 1 Cenerating and Plotting Image Histogramas 


The histogram of a digital image with 元 total possible intensity levels in the 
range [0, GC] is defined as the discrete function 


及 六) = 邮 K 


where rm is the Kth intensity level in the interval [0, C] and mw is the number of 
Pixels in the image wbhose intensity level is 六 .The value of G is 2$$ for images of 


class Uint8,633335 for images of ciass uint16,and 1.0for images of class doub1e， 
Keep in mnind that indices in MAITILAB cannot be 0, so mcorresponds to Intensl- 
ty jevel 0, 一 corresponds to intenslty level 1, and so on, with 产 corresponding to 
level C. Note aljso thatC = 世 一 1forimapges ofclass uint8 and Uint16. 


Often, it is useful to work with mormzalized histograms, obtained simply by 汝 
dividing all elements of 由 (六 ) bythe total number of pixels in the image, which 


we denote by 六 : 




















ar 
全 全 记 人 和 rr 


人 
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0 for 大 = 1 2,..., 志 .From basic probability we recognize D( 上 六) as an estimate 
”ofthe probability of occurrence of intensity level 六 ， 
-The core functon In the toolbox for dealing with image histograms is 
imhist,which has the following basic syntax: 
h = imhist(f，b) 
Wherefis the input image,h is its bistogram, 有 (六 ),and bis the number of bins 
used in forming the histogram (if b is not inclauded in the argument,b = 256 is 
”used by default).A bin is simply a subdivision of the intensity scale. For exam- 
-ple, 这 we are working with uint8 images and we letb = 2,then the intensity 
“scale js subdivided into two ranges: 0 to 127 and 128 to 255.The resulting his- 
.togram will have two values: An(1) equal to the number of pixels in the image 
With yalues in the interval [0,127],and zf(2) equalto the number of pixels with 
:Values in the interval [128,255]. We obtain the normalized histogram simply by 
Using the expression 


P = imhist(f，b)yVnumelL( 了 f) 


Recall from >ection 2.10.3 that function numelL(f) gives the number of ele- 
2 ments in arTray f (ie the number of pixels in the image). 


嘲 Consider the image, f, fom Fig, 3.3(a). The simplest way to Plot its his- 
-togram ls to use imhist with no output specified: 


一 >> imhist(f); 


了 gure 3.7(a) shows the result.This is the histogram display default in the tool- 
box.However,there are many other ways to plot a histogram, and we take this 
“opportunity to explain some of the plotting options in MATLAB that are Tep- 
Tesentatiye of those used in image processing applications. 

”于 stograms often are plotted using par graphs. For this puUrPose we can UsSe 
”the function 


bar(horz，vV，width) 


where v is a row vector containing the points to be plotted, horz ls a vector 
of the same dimension as v that contains the increments of the horizontal 
Scale,and width is a number between 0 and 1.If horz is omitted, the hori- 

2Zontal axis is divided in units from 0 to lengthtv). When width is ]l, the 
bars touch; when it is 0, the bars are Simply vertical lines, as in Fig. 3.7(a). 
”Jihe default value is 0.8. When plotting a bar graph,it is customary to reduce 
局 the resolution of the horizontal axis by dviding it into bands. The following 
”statements produce a bar graph，with the horizontal axis divided into 
”oups of 10 levels: 


ai 
ne 
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了 上 及 AMPLE 3.4: 
Computing and 
piotting image 
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histograms. 
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冯 改 

C 

FIGURE 3.7 
Varlous ways to 
plot an Image 
histograrn. 

(a) imhist， 

{(b) bar， 

(c) Stem 

(d) P1Lot， 








X 10; 








14000 14000 
12000 12000 
10000 10000 
8000 8000 
6000 6000 
4000 4000 
2000 2000 
0 0 
0 50 100 150 200 230 和 50 100 130 200 230 
>>h= Imnhist(f); 
>> h1 = h(1:10:256) | 
>> horz = 1:10:256 | 


>> bar(horz，hti) 

>> axis([0 255 0 15000] ) 

>> Set(gca， 'Xxtick'，0:50:255) 

>> Set(gGca， 'ytick'，0:2000:15000 ) 


Figure 3.7(b) shows the result.The peak located at the high end of the intenst- : 


ty Scale in Fig. 3.7(a) is missing in the bar graph as a result of the larger hori 
zontal increments used in the plot， 


The fifth statement in the preceding code was Used to expand the lower 1 


range of the vertical axis for visual analysis, and to set the orizontal axis to the 
Same Iange as in Fig.3.7(a). The axis function has the Syntax 


axis([horzmin horzmax Vertmin Vertmax ] ) 


which sets the minimum and maximum values in the horizontal and Vertical : 
axes. In the last two statements, gca mmeans“get cUITent axis,”(i.e., the 3aXeS of 
the figure last displayed) and xtick and ytick set the horizontal and vertical 


axes ticks in the intervals Shown， 


Axis labels can be added to the horizontal and vertical axes of a graph Using 


the functionas 















过 站 
蕊 各 


XJLabelI( 'text string '， 
YIabel( text StpPing ' ， 


于 ontsize ，SLze) 
fontSlize ' ，Size) 


”where size ls the font Size ln Points. TIext can be added to the body of the fig- 
ure byusing tunction text, as follows: 

text(xlLloc，y]10c， "text String'"， 'fontSize'，Size) 

Where xloc andyloc define the location where text starts. Use of these three 
fanctions ls lllustrated In Example 3.5. It is important to note that functions 
that set axXls Values and jabels are used after the function has been plotted. 
Atitie can be added to a plot using function tit1e, whose basic Syntax is 


tit1le( titLestring  ) 


Where titlestring is the strimng of characters that will appear on the title， 
centered above the plot 
Asyteri graph ls simjlar to a bar graph.TIhe Syntax is 





Stem(thorz，VvV， Color 1Linestyjle _ marker ， 'filL') 
where Vv ls roOwW vector containing the Points to be plotted, and horz is as de- 
Scribed for bar.The argument， 


Color 1Linesty1le_marker 


.atriplet of values from Table 3.1 For example, stem(V，'r 一 -~S' ) Produces 
astem plot where the lines and markers are red, tbhe jlines are dashed, and the 
_markers are Squares. 由 fillis used, and the marker is a circle, Square, OoT dia- 
Imond, the marker is filled with the coior specified in color.The default color 
本 black, the line default is so1id, and the default marker is a circle. The 
2 stem graph in Fig. 3.7(c) was obtained using the statements 


2 h = imhist(f)， 


Color Marker 


Black 
White 
Red 


Symbol 


Symbol Line Styie 


Solld 
Dashed 
Dotted 
Green Dash-dot 
Blue No line 
CCyan S 
Yelljow d 
Magenta none 


Symbol 


Plus slgn 
Circle 
AsterTlsk 
Point 
CrosS 
quUare 
Diamond 
No marker 
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TABLE 3.1 
Attributes for 
functions stem and 
plot. TIhe none 
attribute 1 
applicable only to 
function plot,and 
must be specified 
Individualiy See the 
syntax for function 
plLot below. 
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了 119256， 
' 咎 I11 ) 


horz = 
steml(horz，h1， 


>> axis([0 255 0 15000] ) 
>> Set(gca， 'Xxtick' ，[0:50:255] ) 
>> Set(gca， 'ytick'，[0:2000:15000] ) 


Finally we consider function plot, which plots a set of points by linking - 


them with straight lines. The syntax js 


plLlot(horz，VvV， 'Color 1inestylLe_nmarker ) 


whbere the arguments are as defined previously for stem plots. The values of ， 
color,1inestyle,and marker are given in Table 3.1.As in Stem, the attributes 
in pLot can be specified as a triplet, When using none for Linestyjle or for : 
marker,the attributes must be specified individually For example,the command 


>> plot{horz，VvV， 'color'，'g' ，'1Linestyle' ， none ， marker ， SS ) 


plots green squares without connecting lines between them. The defaults for 


plot are sollid black jines with no marKerTs. 
Thbe plot in Fig,3.7(d) was obtained using the following statements: 


>> h = Imhizst( 于 ) ; 

>> plLot(Dh) 5%5 Use the default values， 
>> axlis([0 255 0 15000] ) 

>> Set(gca，'xtick' ，f{0:50:255] ) 

>> Set(gca， 'ytick' ，[0:2000:15000] ) 


Function plLot is used frequently to display transformation functions (See 1 
Example 3.5)， 漂 . 


In the preceding discussion axis limits and tick marks were set manually. 并 : 
is possible to set the limits and ticks automatically by using functions ylLimand 


xl1im, which, for OoUT Purposes here, have the Syntax forms 


YJim( auto ) 
XJlLim( auto '  ) 


Among other possible variations of the Syntax for these two functions (See on- 


line help for details), there ls a manual option, given by 


ymax] ) 
Xxmax ] ) 


YLim([yYmzan 
XLEimt(TIxmzn 


which allows manual specification of the limits. If the limits are specified for 


only one axis, the limits on the other axls are Set to ' auto'" by default. We USe 


these functions in the following section， 






















:Typing hold on at the prompt retains the current plot and certain axes 
_properties So that subsequent graphing commands add to the existing graph. 
See Example 10.0 for an ilustration. 


3.2 Histogram Equalization 


ssume for a moment that intensity levels are continuous quantities normal- 
jzed to the Tange [0, ]j, and let 疡 (r) denote the probability density function 
(PDP) of the intensity levels in a given image, where the subscript is used for 
differentiattng between the PDFs of the input and output images. Suppose 
that we Perftorm the foliowing transformation on the inpnut levels to obtain 
utput (processed) intensity levels, sy， 


S 一 了 (r) = 上 Di020) do0 


Where 2 1 a dummy variable of integration, It can be shown (Gonzalez and 
Woods [2002j) that the probability density function of the output levels is 
Lijorrzi that lsS， 

1 for0<y=1 
Pr(s) = 


10 otherwise 


mm other words, the Preceding transformation generates an image whose in- 
.tenslty levels are equally likely and,in addition, cover the entire range [0, 1]. 
“The net result of this intensity-level egualizatiomn process js an image with in- 
“creased dynamic range, which will tend to have higher contrast. Note that 
2 the transformation function is really nothing more than the cumulative dis- 
:tribution function (CDFE)， 
When dealing with discrete quantities we work with histograms and call 
the preceding technique Aistograma equalization, although, in general the 
Distogram of the processed image will not be uniform, due to the discrete na- 
。 ture of the variables. With reference to the discussion in Section 3 
5 Pr/ = 1,2,... ,ZL,denote the histogram associated with the intensity lev- 
”els of a given image, and recall that the values in a normalized histogram are 
apPIOXImationas to tbhe probability of occurrence of each intensity level in the 
Image. For discrete quantities we work with summations, and the equaliza- 
瀑 ， thon transformation becomes 


SK 一 


人 K = 1 2,... ,7L, where sw is the intensity value in the output (Processed) 
mage corresponding to value 产 in the input image. 
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1 


histeq 


E 上 EXAMPLE 3.S: 
Histogram 
equajlization， 


ANOVector 
B=cumsumfA) 
BE3S 1Pc GD OF 
ele1Temi FADUG 
PERereiiaemsiometl 
077G 

忠 = CuUmsumtA, dim》 
81Vem IIe YOUNGLOHE 
妇 e dlensiom yDect- 
Jead py dim， 


Cumsun 


Histogram equalijzation is implemented in the toolbox by function histed， 
whlch has the SyntaxX 


g = histeq(f，nlev) 


where f ls the input image and nlev is the number of intensity jevels specified 
tor the output image. 于 nlev is equal to 工 (the total number of possiblie levels 
In the Input image)j, then histeq implements the transformation function， 
ZU directly. 王 nLev is less than L, then histeq attempts to distribute the 
jeveis so that they will approximate a flat histogram. Unlike imhist, the de- 
fault value in histeqis nlev = 64. For the most part, we use the maximum 
possible number of levels (generally 256) for nlev because this produces a 
true Implementation of the histogram-equalization method just described. 


脐 Figure 3.8(a) is an electron microscope image of pollen， magnified approx- 
Imately 700 times. In terms of needed enhancement, the most important fea- 
tures of thls Image are that it is dark and has a Ilow dynamic range.This can be 
Seen ln the histogram in Fig.3.8(b)j,in which the dark nature of the Image 1S exX- 
pected because the histogram is biased toward the dark end of the gray scale， 
The ljow dynamic range is evident from the fact that the“width”of the his- 
togram lS narrow with respect to the entire gray scale, Letting f denote the 
input image, the foillowing Sequence of steps produced Figs.3.8(a) through (d): 


>> mshow(fTf) 

>> figure，imhist(f) 
>> yYJim( auto ') 

>>g = histeq(f，256) ; 
>> figure，imshow(g) 
>> figure，imhist(g) 
>> ylLim( 'auto ' ) 


The Images were saved to disk in tiff format at 300 dpi using imwrite,and the 
pPjots were Simijlarly exported to disk using the print function discussed in 
ection 2.4. 

The Image in Fig. 3.8(c) ls the histogram-equajized result. The improve- 
ments ini average intensity and contrast are quite evident, These features also 
are evident in the histogram of this image,shown jn Fig.3.8(d).The increase in 
contrast ls due to the considerable spread of the histogram over the entire in- 
tenslty Scale. Jhe increase in overall intensity is due to the fact that the average 
intensjty level in the histogram of the equalized image is higher (lighter) than 
the orlginal, Although the histogram-equalization method just discussed does 
not produce a flat histogram',it has the desired characteristic of being able to 
Increase the dynamic range of the intensity levels in an jimape. 

Asnoted earlier the transformation function 7(myis simply the cumulative 
sum of normalized histogram values. We can use function cumsunm to obtain the 
transtormation function, as follows: 


>> hnorm = Imhist(f) .Anumel(f) ， 
>> Codf = CUmsum(hnormy) ; 











>> X = linspace(0，1，256) ， 


>> 
>>>> 
六 > 


> 


>> 
>>> 





pJot(X，cdf ) 
axis([0 101]) 


Set(gca， Xxtick ，0:.2:1) 
Set(gca， ytick' ，0:.2:1) 
XJlabel( ' Input intensity Values` ， fontSize ，9) 
yjlabel(' Output intensSity values'  ， 'fontSize ，9) 
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Aplot of cdf,shown in Fig. 3.9, was obtained using the following commands: 


% Intervals for [0，1] horiz Scale，Note 
% the Use of linspace from SeC，2.8.1， 

% Plot Cdf VS ，X， 

点 Scale，sSettings，and labe1s: 


>> 所 Specify text in the body of the graph: 


定 >> 


We can tell visually from this transformation function that a narIow Tange of 
Input intensity levels is transformed into the full intensity scaje ln the output 


text(0.18，0.,5， Transformation function'  ， fontsize ，9) 


Image， 








ab 

ed 

FGURE 3.8 
Jilustration of 
histograrm 
equallzation. 

(a) Input image， 
and (bj its 
hlstograrm. 

(c) Histogram- 
equallized 1mage， 
and (d) lts 
hlistogram. TIhe 
Improvement 
between (a) and 
(cj) is quite visible. 
(Original image 
coOUrtesy of DT， 
Roger Heady， 
Research School 
of Biological 
Sclences， 
Australlian 
National 
University， 
CanberTra.) 
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FIGURE 3.9 
Transformation 
function used to 
map the Intensity 
”vajues from the 
input image in 
Fig.3.8(aj to the 
values of the 
output Image in 


Pig. 3.8(c). 
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3.3.3 Histogram Matching (Specification) 


Histogram equalization produces a transformation function that is adaptive, ip 
the sense that lt ls based on the histogram of a given image. However, once the 
transformation function for an image has been computed,it does not change un- 
less the histogram of the image changes. As noted in the Previous section, his- 
tograrm equallzation achieves enhancement by Spreading the levels of the inpnut 
Image over a wider range of the intensity scale. We show in thjs section that this 
does not always lead to a successful result. In particular,it is usefujl in some appli- 
cations to be able to specify the shape of the histogram that we wish the 
Processed Image to have. The method used to generate a processed image that 
has a specified histogram js called jistograma maatcping or Pistogra1t spDeciFcatio1 

The method is simple in Principle. Consider for a moment continuous levels 
that are normajlized to the interval [0, 1], and let r and z denote the intenslty 
levels of the input and output images. The input levels have probability densi- 
ty function Pr) and the output levels have the specified probability density 
function pz(z), We know from the discussion in the previous section that he 
transftormation 


:TCD = 人 Po)du 


resujts in Intensity levels, s, that have a uniform probability density function， 
Pi(5). SUPPose now that we define a variable z with the property 


Ha = /mlo)ao=s 


















Keep in mind that we are after an image with intensity levels z, which have the 
-specified density p.(z). From the preceding two equations,ilt follows that 


z=(s)= 瑟 TOn] 


We can find T(r) from the input Image (this ls the histogram-equalization 
ransformation discussed in the Previous Sectionj, so it follows that We can USe 
-the preceding equation to find the transformed levels z whose 了 DEFE 1S the SPpeC- 
“证 ed p.(z), as long as we can find 妃 下. When working with discrete Varlabies， 
"we can guarantee that the inverse of 吾 exists ii pz(z) is a valid histogram (ie.,， 
“ithas unit area and all its values are nonnegative),andnone of its components 
-zero [Le.,no bin of p:(z) is empty].As in histogram equalization， the discrete 
implementation of the preceding method only yields an approximation to the 
-specified histogram， 
The toolbox Implements histogram matching using the followimng Syntax In 
-histeq: 


g = histeqf(fT，hspec) 


”Where fis the input image, hspec is the specified histogram (a row vector of 
2 specified valuesj, and g is the output image, whose histogram approximates 
the specified histogram, hspec. This vector should contain integer counts cor- 
-Tesponding to equally spaced bins.A property of histeq is that the histogram 
:of g generaly better matches hspec when length(hspec) 18 much Smajler 
-than the ntmber of intensity levels in ff. 


一 杜 Figure 3.10(a) shows an image，f，of the Mars moon，Phobos，and 
Hg.3.10(b) shows its histogram, obtained using imhist(f).The image is dom- 
汗 ”，inated by large, dark areas, resulting in a histogram characterized by a large 
后 “concentration of pixels in the dark end of the gray scale. At first glance, one 
Imight conclude that histogram eqnualization would be a good approach to en- 
hance this image, so that details in the dark areas become more visible. How- 
eyer, the result in Fig. 3.10(c), obtained using the command 


> f1 = histeq(f，256); 


Shows that histogram equalization in fact did not produce a Particularly good 
TIesujlt in this case. The reason for this can be seen by studying the histogram of 
the equalized image,shown in Fig.3.10(d). Here, we see that that the intensity 
levels have been shifted to the upper one-half of the gray scale, thus giving the 
Image a washed-out appearance.The cause of the shift is the large concentra- 
tion of dark components at or near 0 in the original histogram. In turn, the cu- 
mujative transformation function obtained from this histogram is steep, thus 
了 mapping the large concentration of pixels in the ljow end of the gray Scale to 
了 thehigh end of the scale. 
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卫 XAMPLE 3.6: 
Histogram 
matching. 
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FIGURE 3.10 

(aj) Image of the 
Mars moon 

了 hobos. 

(b) Histogram. 
(c) 于 istogram- 
equajized image. 
(d) Histogram 
of (c)， 

(Original image 
COUItesy of 
NASA)， 





50 100 


One possibjlity for remedying this Situation is to use histogram matching， 


with the desired histogram having a lesser concentration of components in the 浊 


low end of the gray scale, and maintaining the general shape of the histogramm 
of the original image. We note from Fig.3.10(b) that the histogram is basically 
blimodal, with one large mode at the origin, and another, smaller mode at the 
high end of the gray Scale. These types of histograms can be modeled, for ex- 
ample, by using multimodal Gaussian functions. The following M-function 
compnutes a bljmodal Gaussian function normalized to unit area, so it can be 
used as a Specified histogram. 


function pP = twomodegauss(m1，sSig1，m2，sSig2，A1，A2，kK) 

s%TWOMODEGAUSS Generates 3a binmodal Gaussian function， 

P = TWOMODEGAUSS(MI ，SIG1，W2，SIG2，A1，A2，K) generates a bimodal， 
6auSsian-like function in the interval [0，1]，P is a 256-element 
Vector normalized So that SUM(P) equals 1，The mean and standard 
deviation of the modes are {Mf，SIG1) and (M2，SIG2)， resSpectively， 
Al and A2 are the amplitude values of the two modes，Since the 


twomodegauSS 
沽 将 


oo oo ge oo 


Ge 
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output is normalized，only the reJlative Values of A1 and A2 are 
important, K 1S an offset Value that raises the“floor"” of the 
function，A good Set of values to try is M = 0.15，SIG1 = 0.05， 
M2 = 0.75，9IG2 = 0.05，A1 = 1，A2 = 0.07，and K = 0.002， 


AT 人 (人 (2 pi 0.5) “51917) 
2 (LS191 2 
A2*x(11/((2*p) 0.5) * Sig2)， 
2 5 区 22) 


让 


让 


= 1inspacef{f0，1，256) ; 
kt+clr* explt-((Z 一 mi) ,2) ./ KK) + .，. 
C2 * exp(-((Z 一 n2) .2) ,/ K2); 


| 


p ,/ Sum(p{:)); 










Thefolowing interactive function accepts inpnuits from akeyboard and plots 
the resujting (ausslan function. Refer to Section 2.10.5 for an explanation of 
the functions input and str2num.Note how the limits of the Plots are set. 


manualIh1lLst 


function P = manualhist 
rr 


ANUALHIST Generates a bimodal histogram jinteractively. 

P = MANUALHIST generates a bimodal histogram USing 
TWOMODEGAUSS(m1T，Sig1，m2，Sig2，A1，A2，k)，ni and m2 are the means 
of the two modes and must be in the range [0，1]. sig1 and Sig2 are 
the Standard deviations of the two modes，A1 and A2 are 

ampjljitude valueSs，and k ls an offset value that raises the 

了 了 10or ”of histogram，The number of elements lin the histogram 

Vector P 1s 256 and Sum(P) is normalized to 1，NMANUALHIST 

repeatedly prompts for the parameters and plots the resulting 
histogram until the USer types an 'X' to quit，and then it returns the 
last histogram Computed， 

A good Set of starting values is: (0.15，0.05，0.75，0.05，1 ， 
0.07，0.002 )， 


于 ITnitialize， 
repeats = true; 
guitnow = X 
Compute a default histogram in Case the user quits before 
各 estimating at least 0ne histogram， 
Ps twomodegauss(0.15，0.05，0.75，0.05，1 ， 


Cycle until an X is input， 
While repeats 
.3Sz=input('Enter ml，sig1， 
if S == quitnow 

break 
end 


s Convert the input String to a vector of numerical values and 
% Verify the number of inputs， 

= Str2numl(S) ; 
if numel(vy) -= 7 


0.07，0.002); 


= 人 


nm2，Sjig2，A1，A2，K 0OR X to quUit; ， 
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disp( Incorrect number of inputS，) 
continue 
end 


Figure 3.11(b) shows the result, The improvement over the histogram- 
equalized resujt in Fig. 3.10(c) is evident by comparing the two images. It is of 
:iterest to note that the specified histogram represents 3a rather modest 
change from the orlginal histogram. This is all that was required to obtain a 
significant improvement in enhancement. The histogfram of Fig. 3.11(b) is 
shown in Fig.3.11(c). The most distinguishing feature of this histogram is how 
ts low end has been moved closer to the lighter region of the gray Scale, and 
hus closer to the specified shape. Note, however,that the shift to the right was 
not as extreIne as the shift in the histogram shown in Fig.3.10(d), which corre- 
:sponds to the poorly enhanced image of Fig.3.10(c). 器 


p = twWomodegauss(v(1)，V(2)，Vv(3)，Vv(4)，V(5)，V(6)，V{7))) 
% Start a new figure and Scale the axes，Specifying onlLyY xlin 
% Jeaves ylLim on aut0， 
figure，pJlot(p) 
xjliml([0 255] ) 
end 3 


yince the Problem with histogram equalization in this example is due Pri- 
marlly to a large concentration of Pixels in the original image with jevels near (0 
a Teasonable approach is to modify the histogram of that imagpe so that it does 
not have thls Property. Figure 3.11(a) shows a plot of a function (obtained with 
Program manualhist) that preserves the general shape of the original his- 
togram，but has a Smoother transition of levels in the dark Tepgion of the 
Intenslty scale. The output of the program, p, consists of 2S6 equally spaced 
points from this function and is the desired specified histogram. An jimage with 
the specified histogram was generated using the command 





区 号 晤 Spatial Filtering 

As mentioned in Section 3.1 and illustrated in Fig. 3.1, neighborhood process- 
ing consists of (1) defining a center point, (x, y); (2) performing an operation 
hat involves only the pixels in a predefined neighborhood about that center 
oint; (3) letting the result of that operation be the “Tesponse”of the PTOceSS 
t ipat point; and (4) repeating the process for every point in the Image. JIhe 
“process of moving the center point creates new neighborhoods, one for each 
:Dixel in the input image. The two Principal terms used to identify this opera- 
tion are 7ieigpporpood processipng and spalial jitering, with the second term 
:being more preyalent. As explained in the following section, if the computa- 


>> 9 = histeq(f，p); 


已 0.02 tions performed on the pixels of the neighborhoods are linear, the operation is 
总 :called jiearsPpatal jiliering (theterm ypatial convolution also used); otherwise 
FIGURE 3.11 0.015 :tis called momlizaear Spatial jirering. 
(a)j Specified 

hilistogram. 3 2 
(b) Re ii of 0.01 34. Linear Spatial Filtering 
es by The concept of iear /Piering has its roots in the use of the Fourier transform 
人 0.005 for signal Processing in the frequency domain, a topic discussed in detail in 
(c) Histogram Chapter 4. In the present chapter, we are interested in filtering operations that 
of (b)， 0 are performed directly on the Pixels of an image. Use of the term jzearspatiat 


0 50 100 150 。 200 250 


ering differentiates thijs type of process from HPreguency doraazn Jiering. 
| TIhe linear operations of interest in this chapter consist of mujtiplying each 
“pixel in the neighborhood by a corresponding coefficient and SumIDing the re- 
sults to obtain the response at each point (x, y). 入 the neighborhood is of Size 
111lcoefficients are required,Jhe coefficients are arranged as a matrix， 
scaled a Fier, rasK iiiermaasK Kerael ie1tpDlate,or Windom, with the first three 
terms being the most prevalent. For reasons that will become obvious Shortly， 
the terms comvotwtior Fiter 44K or Kerzzel also atre Used. 
-The mechanics of linear spatial filtering are illustrated in Fig. 3.12. The 
.Frocess conslsts Simply of moving the center of the filter mask WwW from point to 
”point in an image, 六 At each point (zx, y), the response of the filter at that 
point is the sum of products of the filter coefficients and the corresponding 
”neighborhood Plxejls in the area Spanned by the filter mask. For a mask of size 
一 玫 类 1 weassumetypicallythatzjz =24+1and 关 =20+1whereaandD 
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Figure 3.13(a) shows a one-dimensional function, 六 and a mask, .Tbpe ofi- 
in of 六 is assumed to be its leftmost Point. To Perform the Correlation of the 
two functions, we move W so that its rightmost point coincides with the origin 
if 六 as Shown in Fig. 3.13(b). Note that there are points between the two func- 
ions that do not overlap. The most common way to handle this problem is to 
“pad 太 with as many 0s as are necesSary to guarantee that there wi 册 always be 


FICURE 3.12 The Image Origin 
mechanlcs of ]inear 泣 

Spatial fjltering. 
The magnified 
drawing Shows a 

3 X 3mask and 
the corresponding 





image orresponding points for the fall excursion of z past 太 .This situation is shown 
nelghborhood 下 攻 记 和 13(G] 

directly under it 卫 、 ee 
The neighborhood We are now Ieady to perform the correlation. The first value of correlatlon 


gthe sum of Products of the two functions ip the position Shown In 
ipg.3.13(c).The sum ofproducts is 0 in this case. Next, we move W one location 
o the right and repeat the process [Fig.3.13(d)]. TIhe sum of products again 1S 
.After four shifts [Fig. 3.13(e)], we encounter the first nonzero value of the 
orrelation, which is (2)(1) = 2. 玉 we proceed in this manner Untl 2 IOVes 
ompjletely past 矿 [the ending geometry is shown in Fig. 3.13( 昌 ] we would get 
he result in Fig.3.13(g).This set of values ls the correjlation of 2 and 太 Note 
hat, had we left W stationary and had moved 矿 past W instead, the resuit 
ould have been different, so the order matters， 


1S Shown displaced 
out trom under the 
Inask for ease of 
readability. 









Image 太 c y) 












0(1, 一 了 20(10) 2(1, 1) Correlation Convyolution FIGUR5 3.13 
Ilustration of 
全 区 - Origin 六 包 Origin 大 2 rotated 180"7 one-dimenslional 
Fr-l.y-1D | Alz-1l 要 Wi Ac 
2 00010000 12320 00010000 02321 人 correlationand 
convolution. 
foy-D | ye 六 | Fey+D 00010000 00010000 人) 
12320 02321 


Starting position alignment 


2 Zero padding 






7 人 (x+l,y+1l) 












cj0000000100000000 0000000100000000 (6 
12320 02321 
人 oo0o00o00100000000 00000001000000000 
2320 02321 
Position after one shift 
。 四 -ooooooo1loooooo0o0o0 0000000100000000 mn 
are nonnegative integers.Ajllthis says is that our Principal focus is on masks of . 本 ) 12320 02321 


odd sizes, with the smallest meaningful size being 3 X 3 (we exclude from ouUT 


人 ， . + Position after four shifts 
discussion the trivial case of a 1 X 1 mask).Although it certalinly 18 Dot a Te- : 


quirement, wofrking with odd-size masks is more intuitiVve because they have 3. 人 扑 0000000100000000 0000000100000000(0n) 
Unique center PoOint. SA 旦 本: 王 " 过 02321 
There are two closely related concepts that must be understood ciearly ， Final position 寺 

when performing hnear Spatlal fjterling. One 18 COrrelatomi the other 1$ 人 人 

cOHVOLUHOoP. Correlation is the process of passing the mask W by the Image : 四 000023210000 000123200000 (o) 

array 六 in the manner described in Fig. 3.12. Mechanically, convolution is the 人 

same piocess, except that W is rotated by 180? Prior to passing it by 太 These 出 … same' correlation Tesult 这 二 ES 人 
(0h) 00232100 01232000 (p) 


two concepts are best expljained by Some Simpje examples. 
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The label 'ful1L' in the correlation shown in Fig.3.13(g) is aflag (to be dis Padded FIGURE 3.14 
cussed later) used by the toolbox to indicate correjlation using a padded Image 000000000 Illustration of 
and computed in thbe manner just described. TIhe toolbox provldes another op- ， two-dimensional 
tion, denoted by ' same' [Fig. 3.13(h)] that produces a correlation that is the Origin of Fl 人 Re Correlation and 

， 多 人 六 000000000 Welinii ai 人 WE 
same Size as 太 .This computation also uses zero padding, but the starting posi- 000010 000010000 和 
tion is with the center point of the mask (the point labeled 3 in W) aligned with ， | ， 1 gray to simplify 
the origin of 六 The last computation 8 with the center point of the mask 6 090000000 Viewing， 
aligned with the last point in 三 00000 789 D00000009 ee 

To perform convolution we rotate W by 180” and place its rilghtmost point at (al) (b) 
the origin of ,as shown in Fig. 3.13(). We then repeat the Sliding/computing : 有 于 
process emmployed in correlation, as jillustrated in Figs.3.13(k) through (n).The - SI we 和 站 Re | | 
1 4 因 人 风 中 外 昌 各 ) 人 人 站 

ful1 and 'Ssame' convolution results are Shown in Figs.3.13(o) and (P),re- 45 人 000000 Re 
spectively. 789000000 本 065460 

Function 太 in Fig.3.13 is a discrete unit impulse function that is 1 at one 人 ， 0000089 ， 00987000 32 
location and 0 everywhere else. It is evident from the result in Figs.3.13(o) or | 。 1 ， 1 人 
(P) that convolution basically just“copied”?o at the location of the impulse. 000000000 直人 
This simple copying property (called s 太 ziz8) is a fundamental concept In jin- 人 人 000000000 

.， : RE 0000090039 000000000 
ear System theory and it is the reason why one of the functions 1S always ro- 人 
tated by 180” in convolution. Note that，unlike correlation, Tevertsing the (9) (e) 
order of the functlons yields the Same convyolutlon TeSult. If the function 二 生生 和 
being shifted is symmetric, it is evident that convolution and correlation 90957000000 000000o000 0oao0 
yield the same resujt. 1 5 4 000000 DO0000900000 01230 
The preceding concepts extend easily to images, as jllustrated in Fig. 3.14. 人 ， E ， ， ， 
WE 5 3 【和 全 四 和 罗 了 [ 
The origin is at the top, left corner of image 用, )) (See Fig. 2 To Pertorm - 000010000 000456000 00000 
correlation, we place the bottom, rightmost point of W(xz, 》) So that it coin- 0000000008 和 03 
cides with the origin of F(x, y), as iustrated in Fig, 3.14(c). Note the use of 0 1 ， 。 ， ， ， 
Padding for the reasons mentioned in the discussion of Fig. 3.13. Jo perform 0000000090 n000000000 
correjation, we move W(xY, y) in all possible locations So that at least one of 1ts 人 (g) 人 b) 


Pixels overlaps a Pixel in the original image FIxz,y).This 'fulL1 "correlation 1 
Shown in Fig. 3.14(d). To obtain the ' same' correlation shown in Fig. 3.14(eh， 
we Iequire that all excursions of W(x, y) be Such that its center Pixel oveIjaps 
the original 太 xy 

For convolution, we Simply rotate W(x, y) by 180? and proceed in the Same 
manner as in Correlation [Figs.3.14(f) through (hb)].As in the one-dimensional -: 
exarmple discussed earlier，convolution yields the same result regardless of 
which of the two functions undergoes transilation. In _ correlation the order 
does matter, a fact that is made clear in the toolbox by assuming that the flter 
mask is always the function that undergoes translation. Note also the Impor- 
tant fact in Figs.3.14(e) and (hj that the results of spatial correlation and con- 
volution are Totated by 180?” with respect to each other. This，of course, 18 
expected because convolution is nothing more than correlation with a rotated 
fiiter mask. 

The toolbox Implements Zear Spatial fltering using function imfilter， 
whlich has the following syntax: 


Where f ls the input image, wis the filter mask, g is the filtered result, and the 
other parameters are summarized in Table 3.2.The filLltering_mode Specifies 
Whether to filter Using correlation ('corr') of convolution ('conv 小 The 
boundary_options deal with the border-padding jssue, with the size of the 
:border beling determined by the size of the filter. These options are explained 
_ further in Example 3.7. The size_options are either 'same'， or 'fuU1L1' as 
explained in Figs.3.13 and 3.14. 

”JTJhe most common syntax for imfilter is 


g = infilter(f，w， "repLicate ') 


.This Syntax is used when implementing IPT standard linear spatial filters. 
These fters, which are discussed in Section 3.5.1, are Prerotated by 180",so we 
an Use the correlation default in imfilter. From the discussion of Fig. 3.14， 
MA know that performing correlation with a rotated filter is the same as per- 
iorming conyolution with the original filter. If the fllter is symmetric about its 
:enter, then both options produce the same result, 


站 
0 
AN 


G= imfilter(tf，wW，filtering mode，boundary_options，Size _0ptions) 
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TABLCE 3.2 
Options for 
function 
ImfIlLter. 





rot90(W，kK) ro- 
taieyg w py K*90 de- 
BreeS mperek0 an 
ZiEgeF 


EXAMPLE 3.7: 
Using function 
ImfzJl 二 BeP. 


Options 四 Description 


Eiltrering Moae 
CoPr， Filtering 1s done using correlation (See Figs.3.13 and 3.14). This jg 
the default. 
COnV， Filtering is done using convolution (See Figs.3.13 and 3.14). 
Dorxmdary Optio1s 
P The boundaries of the Input Image are extended by padding with a 
value, P (written without quotes). Ihis ls the default, with value 0. 
-replicate'” JTJhesSize of the Image ls extended by replicating the values ln lts 
outer borderT. 
'Symmetric' The size ofthe Image is extended by mirror-reflecting lt across its 
border. 
“Circular， Thbe Size of the image is extended by treating the Image as one 


Period a 2-D periodic functlon. 
1Ze OPIIDHS 


3 The output ls of the Same Size as the extended (Padded) image 
(See Figs.3.13 and 3.14). 
'Same': Thphe output ls of the Same Size as the Input. TIhils ls achieved by 


lmiting the excurslons of the center of the filter mask to polnts 
contained in the original image (See Figs.3.13 and 3.14). This is 
the defaujt， 


When working with filters that are neither Pre-rotated noT Symmetiic, and 
we wish to perform convyolution, we have two options, One ls to use the SyntaX 


9 = infilter(f，WwW， conv ， replicate  ) 
The other approach ls to Preprocess 1W by using the function rot90(w，2) to 


rotate it 180", and then use imfilter(f，w,， replicate'). Of course these 
two steps can be combined Into one statement, TIhe preceding Syntax PIoduces 


an imagegthatis of the same Slze as the input (ie.,the default in computation 


is the .Same' mode discussed earlier). 

Each eiement of thbe fjtered image is computed using double-precision， 
foating-point arlthmetic, However, 1mfilter converts the output image to 
the same class of the input. Therefore,iffls an integer array, then output ele- 
ments that exceed the range of the integer type are truncated, and fractional 
values are rounded. If more precision is desired in the resujt, then f shouid be 
conyverted to class double by using im2doub1le or double before Using 
ImfiIJLter， 


潮 Figure 3.19(aj is a class double image,f,ofsize 912 X 912 Pixels. Consider 
the simple 31 X 31 filter 


>> W= 0nes(31) ; 
















which is proportional to an averaging filter. We did not divide the coefficients 
by (31)” to illustrate at the end of this example the scaling effects of Using 
-imfilter With an image of class uint8. 


”Convolving filter w with an image produces a blurred result. Because the fil- 


teris symmetric, we can use the correlation default in imfilter. Figure 3.15(b) 
Shows the resujt of performing the following filtering operation: 


>> gd = imfilter(f，w); 
人 imshow(gd，[ ]) 


Where we used the default boundary option, which pads the border of the image 
-With 0s (black).As expected the edges between black and white in the filtered 


Image are blurred, but so are the edges between the light parts of the image and 


-the boundary. The reason, of course, is that the padded border js black, We can 
deal with this difficulty by using the 'replicate' option 


>> gr = imfilter(f，W，'repLicate' ); 
2> Tigure，imshow(gr，[ ]) 


As Fig, 3.15(c) Shows, the borders of the filtered image now appear as exX- 
pected. In this case, equivalent resujts are obtained with the 'Symmetric' 
option 


>> gs = ImfiJIter(f，Ww， 'Symmetric ' ) ; 
人 figure，jimshow(gs，[ ]) 
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Die 

de 

FICURE 3.15 

(aj Original image, 
(b) Result of using 
imfilter with 
default zero padding, 
(c) Result with the 
PepLicate'， 
option,(d) Resuit 
with the 
SYnmmetric 
option. (e) Resuit 
With the 'circular' 
option. (了 Resujt of 
converting the 
Original image to 
ciass Uint8 and then 
flterimng with the 
PPepIicate， 
option.A fter of 
Size 31 X 31 with 
all 1s was Used 
throughonit. 
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nm most image processing applications speed is an OverTlding factor，so 
“Colfilt is preferred over nlfilt for implementing generalized nonlinear 
“spatial filterln8g. 

.Given an Input image,f,of size M X N,andaneighborhood ofsize 六 X 7 
function coLfilt generates a matrix, call it A, of maximum size mm xx MAN 二 in 
hich each column corresponds to the pixels encompassed by the nelghbor- 
:ood centered at a jocation in the image. For example, the first column corre- 
ponds to the PIxXels encompassed by the neighborhood when its _ center is 
Jocated at the top, leftmost point in f. Al required padding ls handled trans- 
arently by colfilt (using zero padding). 

Thesyntax of function coLfilt js 


Pigure 3.15(d) shows the result. However, using the'circular' option 


>> gc = imfilter(f，W，'circular '); 
>> figure，imshow(gc，[ ]) 



























produced the resujt in Fig.3.13(ej, which shows the same prIoblem as with zero- 
Padding. TIhis ls as expected because use of periodicity makes the black parts 


of the lmage adqjacent to the light areas. 
Finally we illustrate how the fact that imfilter produces aresult that is of 
the Same class as the input can lead to difficulties not handled properly: 


>> f8 = Im2uint8(f) ; 














>> = imfi : ) ， 了 
加 | 和 ) ; 9g = colfilt(fT，[m n]，'sliding'，efun，parameters) < Milt 
Figure 3.13(f) shows the result of these operations. Here, when the output was、 ae RS 和 0 人 0 ， 
人 人 ， dicates that the process is one of sliding the zz X 27 Tegion from pixel to pixel 

conyerted to the class of the input (uint8) by imfilter, clipping caused: in the input image f, efun references a funetion 证 d bi ， 
some data loss. The reason ls that the coefficients of the mask did not sum { 是 param ers indicates parameters 1 es 
the range [0, 1]j, resulting in filtered values outside the [0, 255] range. Thus',t 和 yb required by function fun | 人 下 了 国生 LN 
avold this difficulty, we have the option of normalizing the coefficients so that MATLAB data type that con 站 人 和 机 3 0 0 
thelir sum is in the range [0,1] (in the present case we would divide the coe 值 AS 2 ee a function. 人 
cients by (31)” so the sum would be 1), or inputting the data in double for Dacatise of 让 SAwaw ii en 。 和 ， ee 0 
mat. Note, however, that even 这 the second option were used, the data usuall 0 eg EAI 了 人 unctlon Tun must OPer- 
would have to be normalized to a yalid image format at some point (e.g., fo ， ， ee 人 

2 alning the results for all the columns.The Kth element of vis the result of the 
0) SR RE SR 1 Ya ldi fthe <ey Point ij bat data Tange Jperation performed by fun on the Kth column of A. Since there can be up to 
have to be kept in mind to avoid unexpected results. 淖 Ms ol mns 训 A, the maximum dimension of vis] 又 17N 
、 ， he lnear filtering discussed ip the Preylous section has provisions for 
3.42 Nonjlinear Spatial Filtering 时 adding to handle the border problems inherent ip Spatial eine When 
Nonjinear spatial filtering is based on neighborhood operations also, and the 间 ， usmg colfilt,however,the input Image must be padded explicitly before fil- 
mechanics of defining ia X 产 neighborhoods by sliding the center point tering. For this we use function padarray, which, for 2-D functions has the 
through an image are the Same as discussed in the previous section. However，: :8Syntax 
whereas linear spatial filtering is based on computing the sum of prIoducts 册 
(which is a linear operation), nonlinear spatial filtering is based, as the name : fp = padarrav1ff | 多 
impjlies, on nonlinear operations involving the pixels of a neighborhood. For ee 2 


Where fis the input image, fp is the padded image, [r c] gives the number of 
TOWS and columns by which to pad f, and method and direction are 3S EX- 
plained In Jable 3.3. For example,iff= [| 2 3 4],the command 


example, letting the response at each center point be equal to the maximum 
Plxel value in its neighborhood is a nonlinear filtering operation. Another 计 
baslc diftference is that the concept of a mask is not as prevalent in nonjlinear :下 
processing. The ldea of filtering carries over, but the “filter”should be visual- 3 二 
lized as anonlinear function that operates on the pixels of a neighborhood, and 
whose Tesponse constitutes the response of the operation at the center pixel of 
the neighborhood. 

The toolbox provides two functions for performing general nonjlinear filter- 
ing: nlfilter and colfilt.The former performs operations directly in 2-D， 
whlle colfilt organizes the data in the form of columns. Although colfilt 
requires more memory,it generally executes significantly faster than nLfilter， 


人 fp = padarray(f，[3 2]，'replicate'， post') 


居 
及 afways has 7 rows, but the number of columns can Yary depending on the size of the input. Size se- 


主 


。 ection is managed automatically by colfilt， 
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TABLE 3.3 
Options for 
function 
padarray. 


正 XAMPELE 3.8: 
Using function 
CoOJlfiJlt to 
Implement a 
nonlinear Spatial 
fjter. 





progdf(A) retrrzasg 共 Be 
Prodrcl of pipe ete- 
HzEHA OA prod 
(Adim) relaras Ime 
Droducl OF ipe 
eierteHis OFAalorg 
dinzemSiOH dinm。 


















0 >> gd = colfilt(f，fm n]，'sliding'，egmean) ; 
 Symmetric” JTJhe size of the image is extended by mirrorreflecting it across its 


border， There are several important points at play here, First note that, although 


-replicate'” TIhe size of the Image is extended by repjlicating the values in its matrix 大 二 part of the argument in function gmean, it is not included in the 

ee ， parametersS in ColLfilt. JIhis matrix 1S Passed automatically to gmean by 
0 | 0 人 colfilit using the function handie. Also, because matrix Als managed auto- 

人 inatically by colfilt,the number of columnsinAls varlable (but, as noted ear- 
lier,the nuUmber of rows, that is, the column length, is always mn), Iherefore, the 
Size of Amust be computed each time the function in the argument ls called by 
和 2 colLlfilt.TIhe filtering prIocess in this case conslsts of computing the product of 
2 和 all pixels in the neighborhood and then Taising the Testult to the power 17 PP 

人 For any value of (x, yj, the filtered result at that point ls contained In the ap- 
propriate column in v.Ihe function ldentitied by the handle,e,can be any func- 
tion callable from where the function handle was created.TIhe key requirement 


Dizrecfzor 
pre' Pad before the first element of each dimension， 








produces the result is that the function operate on the columns of A and return a TOW vector coOn- 

taining the result for all individual columns. Function colfilt then takes those 

fp = : Iesults and rearTanges them to produce the output Image, 9. 珀 
1 2 2 2 

3 4 4 14 ”Some commonly used nonjlinear filters can be implemented in terms of 

3 4 4 4 ”other MATILAB and IPT functions such as imfilter and ordfilt2 (see 

3 4 4 14 Section 3.$.2). Function spfilt in Section 9.3, for example, Implements the 

3 4 4 4 .geometric mean filter in Example 3.8 in terms of imfilter and tbhe MATLAB 


1og and exp functions. When this js possible, performance usually 1 much 


了 directionis not included in the argument,the default is 'both'.Ifmethod :faster and memory usage is a fraction of the memory required by coJTfIilt. 
ls not included,the default padding is with 0's. If neither Parameter is included 清 Eunction colfilt, howeyerI, remains the best cholce for nonlinear flterimg 
in the argument, the default padding is 0 and the default direction is 'both'， 竹 :operations that do not have such alternate implementations. 


Atthe end of computation, the image is cropped back to its original size. 





杜 As an illustration of function colfilt，we implement a nonlinear filter Image Processing Joolbox Standard SPatial Filters 


whose response at any point is the geometric mean of the intensity values of 汪 - 有 this section we discuss jinear and nonlinear Spatial filters SuUPPorted by IE 
the Pixels in the neighborhood centered at that point.The geometric meanin 3 全 Additional nonlinear filters are implemented in Section 5.3， 
neighborhood of size mm X mis the product of the intensity values in the neigh- 旭 
二 。 。 。 。 党 
borhood raised to the power 1/7z1m. First we implement the nonlinear filterT 于 335.1 Linear Spatial Filters 





functlon, call it gmean: 各 . 

泪 JIhe toolbox supports a number of predefined 2-D linear spatial filters, ob- 
function v = gmean(A) tamned by using function fspecial, which generates a filter mask, w, using the 
nn = Size(A，1); % The length of the Columns of A is always mn， 旭 SyntaX 

= prod(A，1) (17Amn)i 第 


了 w = fspecial('type'，parameters) 
Jo reduce border effects, we pad the input image Using， Say， the ' replicate 
option in function padarray: 让。 where 'type， specifies the filter fype, and parameters further define the 

半 specified filter. The spatial filters supported by fspecial are Summarized in 


>> ff = padarray(f，[mnlj，'replicate' ) |; Table 3.4,including applicable parameters for each filter， 





au 
on 
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2 


ee 
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AR ER Syntax and Parameters that 
Spatial flters 、 
supported by 'average' fspecial( average ) [Fr cj ).A rectangujar aveTagIng filter of =fpxz+lLy)+z-1liy)+Fxy+1l)+Fxzy 一 1)] =- 4F(x，,y 
function Sizer x c.The defaultis3 X 3.Asingle number Instead of 
人 人 ee 罗 本 This expression can be implemented at all points (x, y) in an image by con- 
disk' fspecial( disk '， 让 ,全 circular aveTaging filter 《within 3 olving the image with the following spatial mask: 
Square of size 2r + 1) with radius riIhe defaujt radius 1S 5. 
:gaussian' fspecial('gaussian'，[r c]，sS$ig).A Gausslan ljowpass filter 0 1 0 
of sizer x cand standard deviation sig (Positive). The defaults 1 -4 1 
are3 X3and0.39.Asingle number instead of [fr c] Specifies a 0 1 0 





Square fjteT. 

'Iaplacian' fspecial('1Iaplacian'，alpha).A3 X3Laplacian filter whose 
shape is specified by aipha,anumber in the range [0,1].The 
defaujlt value for alpha js 0.9， 








1 lternate definition of the digital second derivatives takes into account di- 
gonal elements, and can be Implemented using the mask 











'10g， fspecial('10g'，[r cl]，sig).Laplacian of a Gaussian (LOG) 1 1 1 
filter of sizer x candstandard deviation sig (positive). The 
defaults are5 X 5 and0.3.Asingle number lnstead of [Fr cj] 上 
1 1 






specifies a Square fiiter. 

前 OtODn fspecial( 'motion'，1len，theta). Outputs afilter that, when 
conyolved with an image, apPITOXimates linear Inotion (of a 
camera with respect to the image) of len Pixels. TIhe direction of 
motion is theta, measured in degrees, counterciockwlse from the 
horizontal. The defaults are 9 and 0, which represents a motion of 
9 Pixels in the horizontal direction， 

Prewitt， fspecial('prewitt').Outputsa3 xx 3Prewitt masKk, wv,that 
approximates a vertical gradient.A mask for the horizontal 
gradient js obtained by transposing the result:wh = WwWV '. 

'Sobel: fspecial( sobel').Outputsa3 X 3Sobel mask, sv,that 
apPIOXimates a vertical gradient,A mask for the horizontali 
gradient is obtained by transposing the resujt: sh = SVv '. 

unsharp' fspecial('unsharp'，alpha).Outputsa3 X 3unsharp filter. 
Parameter alpha controls the shape; it must be greater than 0 and 
less than or equal to 1.0; the default is 0.2. 










oth derivatives Sometimes are defined with the signs opposite to those shown 
ere, Tesulting in masks that are the negatives of the preceding two masks. 
Enhancement using the 二 aplaclian 1s based on the equation 







ge 力 一 7 妨 二 c[V FOOD] 








here FLz, yj) is the input image,g(x, y) is the enhancedimage,andcisTifthe 
enter coefficient of the mask is positive, of -Tifitis negative (Gonzalez and 
Voods [2002]). Because the Laplacian is a derivative operator, it sharpens the 
age but drives constant areas to zero. Adding the original image back re- 
Stores the gray-level tonality， 

-上 unction fspecial( Japlacian'， alpha) Implements a more general 
Laplacian mask: 


















































EXAMPLE 3.9: 炙 We illustrate the use of fspecial and imfilter by enhancing an image : ] - 下 一 ] 、 
Using function with a Laplacian filter. The Laplacian of an image F(x, ) 门 , denoted VFUz 让 ， 本 人 
0 is defined as 0 
， ae 人 素 汪 ] 十 和 0 1 十 aa 
人 
风 上 和 洁 过 下 池 本 汪 寺 过 
Commonjly used digital apPproximations of the second derivatives are 人 
32/ Which allows fine tuning of enhancement results. However the predominant 
-= jz+Tl 人 +Fz-TLy) 一 20 :use of the Laplacian is based on the two masks just discussed. 
0X 2 We now proceed to enhance the Image in Fig. 3.16(a) using the Laplacian. 
人 This image is a mildly blurred image of the North Pole of the moon. En- 
hancement In this case consists of sharpening the image, while preserving 8sS 
9 -much of its gray tonality as possible. First，we generate and display the 


< 过 se 
5 帮 x 十 二 十 7 一 也 一 270cy) Laplacian filter: 
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3 

所 : 区 : 

FIGURE 3.16 

(a) Image of the 
North Pole of the 
moon， 

(b) LaPjacian 
filtered 1mage， 
Using Uint8 
formats，. 

(c) Laplaclian 
filtered Image 
obtained Using 
double formats， 
(d) Enhanced 
Tesult, obtained 
by subtracting (C) 
from (3a). 
(Original image 
COUrtesy of 


NASA 
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Next we appiy wto the Input image, f, which ls of class Uint8: 


g1 = imfilter(fT，w， replicate ) ; 
“imshow(g1，[ ]) 


igure 3.16(b) shows the resulting image. This result looks reasonable, but has 
:problemn: all its PIXejls are Positive. Because of the negative center 人 filter coef- 
jcient, we Know that we can expect ln general to have aLaplacian image with 
ee values. However， f 1 tb1s case 1 CO cjass Uinta and， as discussed i n 


SR im2doub1le( 了 f) ; 
2 :g2 = lmfilter(f2，W， replicate ) j 
:Inshow(g2，[ ]) 





了 久 AMPLE 3.10: 
Manually 
specifying filters 
and comparing 
enhancement 
technligques. 







e ls to implement this filter manmually and also to cotmpare the resSujts ob- 
ined by using the two Laplacian formulations. The sequence of commands is 
>> W = fspecial('1aplacian'，0) follows: 
册 关 
f = imread( 'moon.tif' ) ; 

w4 = fspecial('`1aplacian' ，0); 点 Sane as Win Examnple 3.9， 
[于 

f = im2double(f) | 

g4 = 二 一 imfilterff，w4， "replicate' ) |; 

98 = 了 一 infiIter(fT，w8， " replicate ) | 

Imshowf(Tf) 

figure，imshow(g4) 

figure，imshow(g8) 


0.0000 41.0000 0.0000 
1.0000 -4.0000 ”1.0000 
0.0000 1.0000 0.0000 


宝 
Oo 
由 


Note thatthe filteris of class double,and that its shape with alpha = 01 也 
Laplacian filter discussed previously. We could just as easily have specified thls 
shape manually as 


>>w= 010;1-41) 010]; 
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: 生 : 
人 
FGURE 3.17 (3) 
Image of the North 
Pole of the moon， 
(b) Image 
enhanced using the 
Laplacian 

fjter 'Iaplacian '， 
whlach bas a 一 4 in 
the center. (c) 
JImage enhanced 
Using aaplaclan 
fijlter with a 一 8 in 
the center. 
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Tanking resujlt.Attention is focused in this section on nonjlinear filters generat- 
edbyordfilt2.A number of additional nonlinear filters are developed and 
implemented in Section 5.3. 

-The syntax of function ordfilt2 is 





g=ordfilt2(f，ocorder，domain) 

This function creates the output image g by replacing each element of f by the 
order-th eljement in the sorted set of neighbors specified by the nonzero ele- 
Iments in domain. Here, domainis an ma X Iatrix of ls and 0s that specify the 
pixel locations in the neighborhood that are to be used in the computation. In 
this sense, domain acts like a mask.TIhe pixels in the neighborhood that corre- 
spondto0inthe domain mnatrix are not used in the computation. For example， 
toimplement a 127 jier (order 1T) of Size 1 X 灵 Weuse the syntax 

g= ordfilt2(f，1f，ones(m，n)) 

mm this formujlation the 1 denotes the lst sample in the ordered set of jz Sam- 
ples and ones (m, n) creates an 1 X mmatrix of lsindicating that all Samples 
in the nelghborhood are to be used in the compnutation. 

Jn the terminology of statistics, a min filter (the first sample of an ordered 
set) js Teferred to as the 0th percentile. Similarly, the 100th percentile is the last 
Sample in the ordered set, which is the mapth sample.This corresponds to a maatx 
ztem which is Implemented using the Syntax 


g= ordfilit2(f，mx*n，ones(m，n) ) 

The best-known orderstatistic filter ip diglital Image Processing is the 
Jiediaj' Jiter which corresponds to the S0th percenittlle. We can USe MATLAB 
Unctlon medianin ordfilt2tocreate amedian filter: 





9 = ordfilt2(f，median(1:mxsn)，ones(m，n)) 






:Where medlant1:msn) Sinply computes the median of the ordered sequence 
5 ,7 Function median has the general syntax 


Figure 3.17(a) Shows the original moon image again for easy comparison， 
Fig. 3.17(b) ls g4, which ls the same as Fig. 3.16(dj, and Fig. 3.17(c) shows g8. 


As expected, thls resujlt is Significantiy sharper than Fig,. 3.17(b). 蜂 : V = 


median(A，dinmn) 

Where V 1l$ vector whose elements are the median of A along dimension dinm. 
orexample, 过 dim = 1,each element of vis the median ofthe elements along 
the corresponding columan of A. 


43. Nonjlineatr Spatial Filters 


A commonly-used tool for generating nonlinear Spatial filters in IPT is func- 
tion ordfilt2, which generates order-statistic Fltery (also called ramK Hter)， 
These are nonlinear Spatial filters whose response is based on ordering (Tank-. 
ing) the pixels contained in an image neighborhood and then replacing the.: 
value of the center pixel in the nelighborhood with the value determined by the 


: 
ecall that the median,e, of a set of values is such that half the values in the set are less than or equaj 
-0 名 and half are greater than or equal to 世 


吕 
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人 人 


xpdfIlt2 





2 
pie 全 了 有 的 和 ， 证 
edian 
人 志 
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:edfilt2 


玉 AMPLE 3.11: 
Median filtering 
with function 
medfilt2. 
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Because of its practical importance, the toolbox provides a specialized im- 
Plementation of the 2-D median filter': 
g = medfilt2(f，fm nl，padopt) 
where the tuple [Im n] defines a neighborhood of size m x_n ovelr which the 
median is computed，and padopt 0 one of three possible border 
padding options: ' zeros' (the default)，'symmetric' in which f is extended 
symmetrlcally by miIITOrTeflecting lt across its border, and :indexed'， in 


which fis padded with 1s ifit is of class double and with 0s otherwise. The de- 
fault form of this function is 


g =medfilt2(f) 


which usesa3 X 3neighborhood to compute the median, and pads the border 
of the input with 0s， 


疆 Median filtering is a useful tool for reducing salt-and- -pepper nolse in an 
image.Ajthough we discuss noise reduction in much more detail in Chapter 5， 
tt will be instructive at this point to illustrate briefly the implementation of 
Imedlan Htering， E 
The image in Fig. 3.18(a) js an X-ray image,f,of an industrial circuit board: 
taken during automated inspection of the board. Figure 3 18(bj is the same- 
Image cofrupted by salt-and-pepper noise in which both the black and white 
polints have a Probability of occurrence of 0.2.This image was generated USing 
function imnoise, which is discussed in detail in Section 5.2.1: 
>> fn = imnoise(f， "Salt & pepper'，0.2); 
Figure 3.18(c) is the result of median filtering this noisy image， Using the 
Statement: 在 





>> gm = medfilt2(fn); 年 


Considering the level of noise in Fig， 3.18(b)j, median filtering Using the de- 
faujlt settings did a good job of noise reduction. Note, however, the black 





the lmage (recall that the default pads the border with 0s). This type of effect 
can often be reduced by using the 'symmetric' option: 
>> gmns = medflIlt2(fn， 'Symmetric ' ) ; 


The result shown in Fig.3.18(d)j,is close to the result in Fig.3.18(c)， eXceptthat . 
the black border effect is not as pIonounced. 呈 





m addition to dealing with image enhancement， the material in this chapter ls the foun- 
“dation for numerous topics in Subsequent chapters, For example, we will encounter Spa- 
.tial processing again in Chapter $ in connection with image restoration, where we also 
take a closer look at noise reduction and noise-generating functions in MATLAB. 
”9ome of the spatial masks that were mentioned briefly here are used extensively in 


Chapter 10 for edge detection in segmentation a plications. The concept of convolu- 
specks around the border, These were caused by the black points surtounding 渤 


tion and correlation is explalined agaln in Chapter 4 from the perspective oft the fre- 
-quency domain. Conceptually, mask processing and the Implementation of spatlal 
fters will surface in various discussions throughout the book. In the process, we will 
extend the discussion be gun here and introduce additional aspects of how spatial fijters 
can be implemented efficiently in MATLAB. 


温 Summary 
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全 6 


FIGURE 3.18 
Median filtering， 
(a) X-ray image. 
(bj) Imasge 
coOITUpted by Salt- 
and-pepPer nolse. 
(c) Result of 
median filtering 
with medfilt2 
Using the default 
Settings. 

(d) Resulit of 
niedian fltering 
using the 
'SYmmetrrc 
Image extension 
option. Note the 
ImPprovement in 
border behavioT 
between (d) and 
(c). (Original 
Image coOuUrteSy 
of LIXl Inc.) 
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coordinate System Spanned by Et V) with zu and yw as (frequency) variables， 
This is analogous to the Spatial dormain studied in the previous chapter, which 
is the coordinate System spanned by F(x, )), with x and y as (Spatial) variables. 
The M X AN rectangular region defined by xx =1012....M-1 and 
攻 = 0,12,N 一 Tisoftenreferred to as the 廊 eqguercy rectaagle. Clearly 
the frequency rectangle ls of the same size as the input image. 

.The inyerse, discrete Fourier transform is given by 


Ti 


LDO777077T17 


TV 
FUx,y) = 一 一 >， >， 尼 (z， 有 2 
LN 二 一 0 了 7 一 人 


orz 一 012...，M- 一 landy=012...,N-- 1.Ihus, given 天 (2 2)，We 
an obtain 帮 x, y) back by means ofthe invyerse DFTThevalues of F(zx, 2)inthis 
quation Sometimes are referred to as the FDrxrier coe 护 cienkg of the expansion， 
”In some formulations of the DFT, the 1/MN term is placed in front of the 
ansform and in others it js used in front of the inyerse. To be consistent with 
AILAB s Implementation of the Fourier transform, we assume throughonut 
e book that the term is in front of the inverse, as Shown in the preceding 
equation. 电 ecause array indices in MATILAB start at 1,rather than 0,F(1，1) 
:andf(1，1) in MAILAB correspond to the mathematical quantities 下 (0, 0) 
:and j(0,0) in the transform and its inverse. 

-The yalue of the transform at the origin of the frequency domain [ie,， 
(0,0)] is called the dc component of the Fourier transform.This terIninolopgy 
-from electrical engineering, where“dc” signijfies direct current (current of 
Zero frequency).Itisnot difficujt to show that F(0,0) isequalto MNtimes the 
ayerage Value of F( xz， y)， 

Even 让/x, y) isreabits transform in generalis complex.The PrIincipal method 
ef visually analyzing a transform is to compnute its spectru7 [ethe magnitude of 
下 (2 and display it as an image.Letting Rtz o) and 7 (w, 7) Tepresent the real 
andimaginary components of E(z w),the Fourier spectrum is defined as 


FE( 人 (eol = [RD + Po 


PyeziezU 


For the most part, this chapter parallels the filtering topics discussed in Chapter 3， 
but with all filtering carried out in the frequency domain via the Fourier trans- 
form. In addition to being a corerstone of jinear 人 tering, the Fourier transform 
offers considerable flexibility in the design and implementation of filtering solu- 
tions In areas Such as image enhancement, image restoration, imapge data com- 
Presslon, and a host of other applications of practical interest. In this chapter, the 
focus ls on the foundation of how to perform frequency domain filtering in MAT- 
LAB.As in Chapter 3, we ilustrate filtering in the frequency domain with exam- 
ples of Image enhancement, including lowpass 人 itering, basic highpass filtering， 
and hlgh-frequency emphasis filtering. We also show briefly how spatial and fre- 
quency domaalin Processing can be Used in combination to yield results that are Su- 
perior to using either type of processing alone. The concepts and techniques 

developed in the following sections are quite general, as is amply ilustrated by 浊 
other applications of this material in Chapters $,8,and IT1. 3 





















JIhe Phnase angle of the transform is defined as 


The 2-D Discrete Fourier Transform 


Let Fr y),forz=012,M-1landy=012.,N-1ldenotean | 
M Xx AN Image. TIhe 2-D, discrete Fourier transjform (DFT) of 放 denoted by 洁 
EU,is given by the equation 


-The preceding two functions can be used to represent F(z,v) in the familiar 
“Polar representation of a complex quantity: 


(WuV) = |F(u， 四)|e- 语 (ee 沁 


M -LA 一 1 


.The 二 
FU 2) = Fe， 站 ETPr(orM+eN) 汪 POWer Spectrum ls defined as the square of the magnitude: 


Po) = (ez 

尽 (VD 十 (oo 

-For purposes of visuajization it typically is immaterial whether we view 
IE(ao)lor P(e 四 )， 


有 


for2=01l2. .MUM 一 1and27=012...N--1 Wecould expand the 
exponential into sines and cosines with the variablesw and y determining their 
frequencies (Y and yare Summed out). The 广 eqguency dormain is simply the 


4.1 里 The 2-D Discrete Fourier Transform 
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FIGURE 4.1 

(a) FourierT 
Spectrum showing 
back-to-back half 
periods in the 
interval 

[0, M -1 . 

(b) Centered 
Spectrum in the 
same interval， 
obtalned by 
multiplying 帮 x) 
by (一切 Prior to 
computing the 
Fourier 
transform， 


If FUxz, y) is real, its Fourier transform is conjugate SymInetric about th 
Origin; that 1S， 


F(zo) = 已 (一 心 一 7 
which implies that the Fourier Spectrum also is Symmetric about the orlgin: 
Fe 2 = 人 
It can be Shown by direct substitution into the equation for 三 (su) that 
Fa =EU+AMO)=Flu+TN)=PUT+AMMV+AN 


JIn other words, the DFT is infinitely periodic in both the x and vw directions 
with the periodicity determined by WM and N. Periodicity is also a Property o 
the Inverse DEFT 


Flx 站 =jrz+My)=Aoy+N=FT+MyTAN 


That is, an image obtained by taking the inverse Fourier transform is also inf 芋 
nitely periodic. This is a frequent Source of confuslon because it ls not at alj in: 
tujtive that images resulting from taking the inverse FouIier transtorm Shou 
turn out to be Periodic. 工 hejlps to remember that this is simply a mathematica 
property of the DFT and its inverse.Keep in mind also tbat DET impiementa 
tions compute only one Period, So we work with arrays of Size M x A. 
The periodicity issue becomes important when we consider Aow DFI datar 
late to the periods of the transform., For instance, Fig, 4.1(a) Shows the Spectrum 
of a one-dimensional transform, FE(z). In this case, the periodicity expression be 
comes F(i = FU + M) ,from which it follows that |F(o)| = |jE(e + M 
also, because of symmetry |E(o| = |E( 一 中 |.The periodicity property indicates 
that E() has a period of length Adf,and the Symrmetry PrOPerty indicates that the 
magnitude of the transform is centered on the origin, as Fig,4.1(a) shows. Thils fig 
ure and the preceding comments demonstrate that the magnitudes of the trans: 
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4.1 各 The 2-D Discrete Fourier Iransforma 


ivalues from AM/2toM -larerepetitions of thevaluesin thehalfperiod to 
eft of the origin. Because the 1]-D DFT is impiemented for only M points 
forvalues of xin the interval |0,M -1j),itfolowsthat computing the 1-D 
insform yields two back-to-back half periods in this interval. We are interested 
obtaining one full,properpy ordered period in the interval [0, AM 一 j.Itisnot 
difficult to show (Gonzajlez and Woods [2002]) that the desired period js obtained 
multiplying 了 (X) by 全 1) PrIior to computing the transform. Basicajlliy what 
hisdoes js move the origin of te tra1asjorza to thbhe point UL 三 141]2,asEFig.4.1(b) 
DOwWs. Now, the value of the spectrum at = 0 in 下 g. 4.1(b) corresponds to 
-M12)| in 再 g. 4.1(a). Similarly, the values at 区 M/2)| and |F(CAM - 1T)| in 
4 1(b) correspond to |F(0)| and |E(M/2 - 1)in Fig.4.1(a). 
轴 similar Situation exists with two-dimnensional functions. Computimng the 2-D 
和 nowyields transform points in the rectangular interval shown in Fig. 4.2(a)， 
Where the shaded area indicates values of P(z%,v) obtained by impiementing the 
) Fourier transform equation defined at the beginning of thjs section. The 
ed Iectangles are periodic Tepetitions, as im Fig. 4.1(a). JIhe Shaded Iegion 
sthat the values of FL V) noOw encompass four back-to-back quarter Peri- 
ds that meet at the point shown in Fig. 4.2(a).Visual analysis of the spectrum is 
implified by moving the values at the origin of the transform to the center of the 
quency Iectangle.This can be accomplished by multiplying F(xz, y) by (一 1) 7 
iior to computing the 2-D Fourier transform. The periods then would align as 
iown in Fig.4.2(b).As in the previous discussion for 1-D functions, the vaiue of 
Spectrum at (AMW1/2, N/2) iFig. 4.2(b) is the same as its value at (0,0) in 
ig. 4.2(a),， and the value at (0,0) in Fig. 4.2(b) js the Same as the value at 
2, -N/2) in Fig. 4.2(a). Similarly the value at (M 一 AN 一 1 am 
ieg:4.2(bj js the Same as the value at (M/2 一 LN/2 一 1)inFg.42(a). 





| 1 
| | 
NA 入 一] | 





| Four back-to-back | 
perlods meet here， 





pe 


枉 = Periods of the 2-D DFT 


= MXANdataarrayTresSulting from 
the computation of 天 (ZL, 7). 


cuR 4.2 (a) M xx N Fourier Spectrum (shaded)，showing four back-to-back quarter 
eriods contained in the spectrum data. (b) Spectrum obtained by multiplying Fx, y) by 
0 Prior to computing the Fourier transform. Only one period is shown shaded because 
1 ithe data that would be obtained by an n implementation of the equation for 下 《4 二 


二 
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The preceding discussion for centering the transform by multiplying /xz 妨 
by (-1” is an important concept that is included here for completeness. 
When working in MATLAB,the appProach is to compnute the transform without 
multiplication by (-1) yand then to rearrange the data afterwards using func: 
tion fftshift.This function and its use are discussed in the following sectione 


Computing and Visualizing the 2-D DFI In MAILAB 


The DEFT and its inverse are obtained in practice Using a fast Fourler trans- 
form (FFT) aigorithm.The FFET ofan M X Nimage array fis obtained in the: 
toolbox with function fft2, whblch has the Slimple Syntax: 


F = fft2(f) 


This function retulins a Fourier transform that 1s also of Size AM X N, with th 
data arranged in the form shown in Fig. 4.2(aj; that is，with the origin 0 
the data at the top left, and with four quarter Perlods meeting at the cente 
of the frequency rectangjle. 

As explained in 9ection 4.3.1, it ls necessary to pad the Input Image with ZerIo 
when the Fourier transform is used for fllterimng. In thls case, the Syntax become 


F = fft2(f，P，Q) 


With thls Syntax, fft2 pads the input with the requlred number of zeros S 
that the resultng function ls ot Size 已 X 必 ， 
The Fourier Spectrum 1s obtalined by using function abs: 


S = abs(F) 


whbich computes the magnitude (square root of the sum of the squares of 也 
real and Imaginary parts) of each element of the array. 

Visual analysls of the spectrum by dispijaying it as an image ls an Importan 
aspect of working in the frequency domaln. As an liustration，,consider 岂 
simple image, f, in Fig. 4.3(a). We compute its Fourier transfornm and dispjay 
the Spectrum using the tollowing Sequence of steps: 


Where F js the transtorm computed using fft2 and Fc is the centered trans- 
lomm. Function fftshift operates by Swapping quadrants of F. For example, 让 
a= [12; 34],fftshift(a) = f4 3; 2 1].When applied to a transform 





atter it has been computed, the net result of uslng fftshift is the same as 让 
the Imput image had been multiplied by (一 1)3+*y7 prior to computing the trans- 
form. Note, however, that the two processes are 1aot interchangeable. That is， 
jetting S['] denote the Fourier transform of the argument，we have that 


SL(1D) 7 让)] is equal to fftshift(fft2(f)), but this quantity is not 
equal to fft2(fftshift(f) )， 
了 the Present example, typing 


>> F = fft2(f); 
>> S3= abs(F); 
>> imshow(S，[ ]) 


Ce 





Figure 4.3(b) shows the result. The four bright spots in the corners of the 
image are due to the periodicity Property mentioned in the previous section 

IPT function fftshift can be used to move the origin of the transforrm to 
the center of the frequency rectangle.Jhe syntax ls 


S fc = fftshift(F); 
imshow(abs(Fc)，[ ]) 


yielded the image in Fig. 4.3(c), The result of centering is evident in this 


Fc = fftshift(F) mage 
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:3 已 

FIGURE 4.3 

(ajA simple image. 
(b) FourierT 
spectrum. 

(c) Centered 
Spectrum. 

(d) Spectrum 
visually enhanced 
byalog 
transformation. 
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Although the shift was accormpPjlished as expected,the dynamic range of hb 
vajlues in this spectrum is So large (0 to 204000) compared to the 8 blts of the: 
display that the bright values in the center dominate the resuit.As discussed in 
Section 3.2.2, this difficulty is handled via a log transformation. Thus，thei 
cormmands 


fom found-off errors that are characteristic of foating Point computations. 


ons can be comblined: 


sf = realf(ifft2(F) ) 


>> S2 = 10g(1 + abs(Fc) ) ; 
>> imshow(S2，f ]) 


resulted in Fig. 4.3(dj.The increase in visual detail js evldent ln thls Image. Which pads F with zeros so that its size is 已 X CQ before computing the inverse. 
Function ifftshift ITeveIses the centering, Its SyntaX 1S 罗 This option is not used in the book. 


F = ifftshift(Fc) 


This function can be used also to convert a function that is initially centered on 时 
a rectangle to a function whose center is at the top, jett corner of the rectan8gl 
We make use of this property in Section 4.4， 

While on the subject of centering, keep in mind that the center of the 人 
quency rectangle js at (af1/2, N/2)iithe yariablesxandvrunfromg0to M 一 
and N -1,respectively For example,the centerofan 8 xx 8frequency Squar 
is at point (4.4), which is the Sth point along each axis counting up from (0， 
If as in MATLAB,the variables run from l to M and l to AN,respectively, the 
the center of the square js at [(M/2) + 1 (NA/2) + 1 Ithe case of at 
8 X 8 example, the center would be at point ($, 3), counting up from 《11 
Obviously, the two centers are the same point, but this can be a SOuUrce of co 
fusion when deciding how to specify the location of DFT centers in MATLA 
coOmputatlons. 

If M and N are odd,the center for MATLAB computations ls obtained b 
rounding AM1/2 and N/2 down to the closest integer. The rest of the analysls 
as in the previous paragraph. For exampie, the center of a 7 X 7 region 1S 
(3,3) 让 we count up from (0,0) and at (4,4) 填 we count up from (1, 1).Ine 
ther case, the center is the fourth point from the origin. Ionly one of the d 
mensions is odd, the center aliong that dimension is simjlarly obtained b 
rounding down in the manner just explained. Using MATLAB's functlo 
floor, and keeping in mind that the origin is at (1, 1), the center of the 红 
quency rectangle for MATLAB computations ls at 


e give 3 brief overIview of the concepts involved in frequency domain fjiter- 
ing and its implementation in MATLAB. 


4. Fundamental Concepts 


The foundation for jinear filtering in both the spatial and frequency domalns 1S 
he convolution theorem, which may be wrIitten as 


xy)*PPy) 人 瓦 (VE TD) 
| conversely， 
FF yy 全 万 (VD)*C( TD 


ere, the Symbol“*”imdicates convolution of the two functions, and the ex- 
fesSsions on the sides of the double arrow constitute a Fourier transform palr. 
or example, the first expression indicates that convolution of two spatlal 
inctions can be obtained by computing the inverIse Fourier transform of the 
Toduct of the Fourier transformas of the two functions. Conversely, the for- 
ard Fourier transform of the conyolution of two spatial functions gives the 
roduct of the transforms of the two functions, Similar comments apply to the 
econd expression. 

jn terms of filtering, we are interested in the first of the tfwo Previous exX- 
pressions，Filtering in the spatial domain consists of convolving an image 
(xz, y) with afilter mask,A(x, y. Linear spatial convolution is precisely as ex- 
ined in Section 3.4.1.According to the convolution theorem, we can obtain 
Same result in the frequency domain by multiplying 玉 (2V) by 五 (4 站 )， 
eEFourier transform of the spatial filter. It is customary to refer to 殖 ( 引 2) as 
he iiter transjer 记 nctio7. 

人 Basically the idea in frequency domain filtering is to select a filter transfer 
function that modifies F(o ) in aspecified manner,For example,the filter In 


ffloor(M/2) + 1，TflLoor(N/2) + 1] 


The center given by this expression is valid both for odd and even values of 呆 : 


and A， 
Finally we point onut that the inverse Fourier transform is computed usin 
function ifft2. which has the baslc SyntaxX 


f = ifft2(F) 





where F is the Fourier transform and f ls the resulting Image. Ifthe Input uSe 
to compnute F is real, the inverse in theory should be real. In practlce, howeye 


or digital images, these expressions are strictly valid only when /fxe.y) and An, y) have been proper- 
padded with zeros. as discussed later in this section。 





output of ifft2 often has very small Imaglinary comPponents ITesuiting 


Thus, it is good practice to extract the real part of the resujt after computing 
be jinverse to obtain an Image conslsting only of real Values. The two opera- 


4.3 遇 Filtering in the Frequency Domain 工 5 





real(arg) ard 
imag(arg) errracr 
ie real ad LIG8L- 
Mary Pr OF arg， 
resPpectively. 
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二 

FIGURE 4.4 
Transfer functions 
of (a) a centered 
Ilowpass filter, and 
(b) the format 
Used for DFT 
filtering. Note 
that these arTe 
frequency domain 
flters. 
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The following function, called paddedsize, computes the minimum even' 
ualues of P and C requked to Satisfy the Preceding equations. It also has an 
option tO pad the inputs to form square images of size equal to the nearest in- 
eger POwWer of 2. 下 xecution time of FFT algorithms depends roughly on the 
mumber of prime factors in 已 and C. TIhese algorithms generally are faster 
When 已 and C are powers of 2 than when Pand CO are prime. In practice, it js 
idvisable to work with square images and filters so that filtering js the same in 
both directions. Function paddedsize provides the flexibility to do this via the 
ioice of the input Parameters. 

了 function paddedsize,the vectors AB, CD, and PQ have elements [A B]， 
D],and {fP Qj,respectively where these quantities are as defined above. 





Fig. 4.4(aj has a transfer function that, when multiplied by a centered 天 (以 ， y) 
attenuates the high-frequency components of Eu while leaving the low 
frequencies relatively unchanged, Filters with this characteristic are called 
lowpass Jillers. As discussed in Section 4.5.2, the net result of 1owpass filtering 
is image blurring (Smoothing). Figure 4.4(b) shows the same filter after it wase 
PTrocessed with fftshift.This is the filter format used most frequently in the 
book when dealing with frequency domain filtering in which the Fourier trans: 
form of the input is not centered. ; 

Based on the convolution theorem, we know that to obtain the correspond: 
ingfltered Image in the spatial domain we Simply compute the inverse Fourier 
transform of the product 互 ( VE Vi.Itis important to keep in mind that 
the Process just described is identical to what we would obtain by using conyo- 
lution in the Spatial domain, as long as the flter mask, P(X, y)， is the inverse 
Fourier transform of 束 (w, 7). In practice, spatial convolution generally is sim- 
plified by using Small masks that attempt to capture the salient features of 
thelr frequency domaln counterparts. 

As noted in Section 4.1， images and their transforms are automatical 
considered periodic 寺 we elect to work with DFTs to implement filtering.It 
not difficujt to visuallize that conyolving periodic functions can cause interfe 
ence between adjacent peiiods if the Periods are close with resPect to the d 
ration of the nonzero parts of the functions. This interference，called 
Ww7aDparoPa erro can be avoided by padding the functions with zeros, in 也 
following mannerT. 

Assume that functions FUx, y) and mn(x,y) areofsize 4Xx 了 BandCxXxDD 
TeSpectively. We form two extemaded (padded) functions,both of size 尼 XOb 
appending zeros to 厂 and 8. It can be Shown that wraparound error is avolide 


nction PQ = paddedsize(AB，CD，PARAM) paddedsize 
PADDEDSIZE Computes padded Sizes Useful for FFT-based filtering. ee 
“ PQ = PADDEDSIZE{AB) ，where AB is a two-element Size vector， 

Computes the two-element Size vector PQ = 2xAB， 

-. PQ = PADDEDSIZE{(AB， " PWR2 ) Computes the vector PQ such that 

_PQ(1) = PQ(2) = 2^nextpow2(2*m)，where m is MAX(AB) ， 


_PQ = PADDEDSIZE(AB，CD)，where AB and CD are two-element size 
Vectors，Computes the two-element Size Vector PQ， The elementSs 
of PQare the smallest even integers greater than or equal to 
-AB + CD ~ 1. 


_PQ = PADDEDSIZE{AB，CD，'PWR2') computes the vector PQ such that 
Pa 人 1) = PQt2) = 2^nextpow2{2xm)，where m Is MAX([AB CD]j ) . 


lseif nargin == 2 & ~ischarfCD) 
PRQ=AB+CD-1; 

FQ .= 2 * 6il(PQ 广 g 邹 ? 

Seif nargin == 

mi = max(AB); 名 Maximum dimension， 


和 Find power-of-2 at 1east twice 由 ， 0 
“= 22nextpow2(2x*m); TBX 
[5 让 | 
f nargin == 3 


-了 





P = nextpow2fn) 


by choosing max([AB CD]); 先 Maximum dimension. Feftlmag 雪 e saleyt 
= nextpow2(2x*m); pteger power oF 2 
玉 过 : 直 囊 尼 寺 QQ = [P，P]; jaris8gredter pa Or 
edGHUal IO Ine db8yoUte 
error( Wrong number of inputs.') 0 
本 -一 


Most of the work in this chapter deals with functions of the same size, M X 
in Which case we use the folowing padding values: 已 2M -1 and 
忆 二 2AN 一 1 工 . 





”Sustomary to work with arrays of even dimensions to speed-up FFT computations， 
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With PQ thus computed using function paddedsize, we use the followin 
Syntax for fft2 to compnute the FFT using zero padding: 


F = fft2(f，PQ(1)，PQ(2)) 


JTJhbls syntax Simply appends enough zeros to f such that the resulting image 
of Size PQ(1)xPQ(2), and then computes the FFT as previously described 
Note that when using padding the 8lter function in the frequency domain mt 
be of size PQ(1) xPQ(2) also. 


也 及 AMPLE 4.1: 
下 itects of fitering 
with and without 
Padding. 


性 JIJhe image, f, in Fig. 4.5(a) is used in this exampjie to illustrate the differ 
ence between fltering with and without padding. In the following discussioi 
we use functlon lpfilter to generate a Gaussian ijowpass filters |similar 
Fig. 4.4(b)] with a specified value of sigma (sig). This function is discusseda 
detail in Section 4.5.2, but the syntax is straightforward, so we use it here aa 
defer further explanation of Lpfilter to that section. 

The toliowing commands perform fiitering without padding: 


>> [M，NJ = Size(f); 
>> FF = 于 ft2( 下 ); 


>> Sig = TO0; 

>> = JpfiLter( gaussian ，M，N，Sig) ; 
>>G= H.*F; 

>> 9 = Feal(iIifft2(G) ) |; 


>> imshow(g，[ ]) 


CN 














YIng， jhey are not part of the data.The dashed lines are used to designate 
RDitiarily) the AM X N image processed by fft2. Imagine convolving a blur- 
blter with this infinite periodic sequence. It is clear that when the fiiter 1S 
43SIng through the top of the dashed image it will encompass part of the 
age ltself and also the bottom part of the periodic component right above it 
us when a jight and a dark region reside under the filter, the result will be 
mid-gray， blurred output, This is Precisely what the top of the Image In 





a bc 
FIGURE 4.5 (a) A simple image of size 256 X 256. (b) image 1owpass-filtered in the frequency domain w 
out padding.{c) Image lowpass-filtered in the frequency domain with padding. Compare the light portio: 
the vertical edges in (bj and (c)， 
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FICURE 4.6 

(a) Implied， 
inftinlte Perlodic 
sequence of the 
Image in 
Fig.4.3(a). The 
dashed reglon 
represents the 
data processed by 
fft2. (bj The 
same Periodic 
sequence after 
padding with 0s. 
The thin white 
lines in both 
Images are Show 
for convenience 
in viewing; they 
are not Part 

of the data， 
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FIGURE 4.7 Fuli 
padded image 
Tesujlting from 
ifft2 after 
bjtering. Thls 
Image ls of size 


512 X 912 Pixels. 
































ecall fom wection 3.4.1 that this call to function imfilter pads the border 


Fig.4.3(b) shows. On the other hand, when the filter is on the light sides of th 
f tb image with 0s by default. 后 


dashed image,lt will encounter an identical region on the periodic component 
Since the average of a constant Tegion js the same constant, there is no blur 
ring in thls part of the resujlt. Dther parts of the image in Fig. 4.S(b) are ex 
Plained im a similar mannerT. 

Conslder now fijtering with Padding: 





:2 Basic teps in DFT Fijltering 


2 discussion ln the previous Section can be Summarized in the following 
p-by-step procedure Involving MATILAB functions, where f is the image to 


ae is of 
>> PQ = paddedsize(size(f))， fltered, g 1lS the result,and lt is assumed that the filter function 豆 (x, vis o 


>> Fp = fft2(f，PQ(1)， 0 % Compute the FFT with padding he same SiZe as the padded image: 

>> Hp = 1LIpfilter('gaussjian' ，PQ(1)，PQ(2)，2*sig)， 
>> Gp = Hp.*Fp， _Obtain the padding parameters using function paddedsize: 
>> gp = paddedsize(Size(f)); 


real(Ifft2(Gp) ) ; 
>> gpc = gp(1:Size(f,1)，1:Size(f,2))， 


>> imshow(gp,，[ ]) 2. Obtain the Fourier transform with padding: 


= fft2(f，PQ(T)，PQ(2) ); 


where we used 2*sig because the filter size is now twice the size of the fter _Generate a filter function, H, of Size PQ(1) x PQ(2) using any of the 
used without padding. Imethods discussed in the remainder of this chapter. The filter must be in 
Figure 4.7 shows the full,padded result, gp. The final resujlt in Fig.4.5(c) was _ the format shown in Fig. 4.4(b).Ifit is centered instead, as in Fig. 4.4(a)， 
obtained by cropping Fig. 4.7 to the original image size (See the next-to-lasf letH = fftshift(H) betore using the filter. 
command above). JIhis Tesult can be explained with the aid of Fig. 4.6(b 
whlich shows the dashed image padded with zeros as ii would be set up inte 
nally in fft2(f，PQ(1)，PQ(2)) prior to compnuting the transform. The im: 
pljied periodicity is as explained earlier. The image now has a uniform blac 
border all around it, so convyolving a smoothing fter with this infinjte $ - 
quence would show a gray blur in the light edges of the images.A similar resu 二 
would be obtained by performing the following spatial filtering， 


下 


Multiply the transform by the filter: 
.G = H.xF， 


06. the top, left rectangle to the original Size: 
= g(1:Size(f，1)，1:Ssize(f，2)); 


>> h = fspecial( 'gaussian'，15，7); 


>> gs = imfilter(f，h)， This fltering Procedure is summarized in Fig. 4.8.The PIePIOcessing Stage 


mlght encompass Procedures Such as determining image size, obtaining the 
padding Parameters, and generating a flter. Postprocessing entails compnuting 
the real part of the result， cropping the image, and converting it to class uint8 
5 Jint16 for storage. 


Frequency domain filtering operations 












Inverse 
Fourier 
transforrmm 


Filter 
function 
所 (or 1) 







Fourier 
transform 





天 (0 2) 已 (re VE 





Post- 
Processing 上 估 


St,y) 
Filtered 
image 






4.3 多 Filtering in the Frequency Domain 


FICURE 4.8 
Baslc steps for 
filtering in the 
frequency 
domaln. 
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dftfilt 


和 裤 - rr 














































The fllter function 如 人 (oo iD Fig. 4.8 muitiplies both the real and imaginary riachine and algorithms used and on issues Such the Sizes of buffters, how wel 
parts of (5 2). 下 玖 (oj) is real,then the phase of the resujt is not changed,: 0 plex data are handljed, and a host of other factors beyond the scope of thls 
fact that can be Seen in the phase equation (Section 4.1) by noting that,ifthe mul Tiscussion. A_ comparison by Brigham |1988] using 1-D functions shows that 
tpliers oft the real and Imagimnary Parts are equal, they cancel out, leaving the 竹 革 terImg usIng an 上 FFTI algorithm can be faster than 3a Spatial Implementatlon 


phase angle unchanged.EFilters that operate in thls manner are caljed zero-PHna 
SijJUiery. IJhese are the only types of lneartflters considered in thlis chapter， 

lt ls well known from linear System theory thbat, under certain mllid condi 
tions, inputting an impulse into a linear System completely characterizes th 
System. When worKking with finite, discrete data as we do in thlis book, the re. 
sponse of a linear System, including the response to an impulse, also is finite.I 
the jinear System ls ]ust a spatial flter, then we can completely deteImine the 
filter Sinlply by observing its response to an Imnpulse.Afilter determined in thi 
manner is called a jite-ipzpuise-respomsye (FIR) jier.Allthe linear Spatial 强 
ters in thls book are 下 IIR filters. 


F 0 large. TIhus,lt ls useful to know how to convert a Spatial flter into an 
9 jivalent frequency domaln filter in order to obtain meaningful comparisons 
ietween the two approaches. 

One obyvious approach for generating a frequency domailin filter, H, that 
esponds to a given Spatial filter, h,ls to jetH=fft2(h， PQ(1)，PQ(2) )， 
re the values of vector PQ depend on the Size of the image we want to 拉 - 
as discussed jn the jast section. However, we are interested in thlis Section 
mtwo major toplcs: (1) how to convert spatial filters into' equivalent fre- 
tency domain filters; and (2) how to compare the resujlts between SPatial 
ain filtering Using function imfilter, and frequency domain hjtering 
gthe techniques discussed in the previous Section. Because, as exXplained 
etail in Section 3.4.1,. imfilter uses correlation and the Orlgin of the 妇 1- 
er js considered at its center, a _ certain amount of data PrepPIocesslng 1S Te- 
ed to make the two approaches equlvalent, The toolbox Provldes 3 
inction, freqz2, that does precisely this and outputs the corresponding 扩 ]- 
:in the frequency domaln， 

Function freqz2 computes the frequency response of FIR 旨 ters, whlich, as 
ntioned at the end of Section 4.3.2, are the only linear filters considered 训 
his book. The result is the desired filter in the frequency domain. The Syntax 
finterest in the present discusslon 1S 





.4.4 An M-function for Filtering in the Frequency Domain 


The sequence of filtering steps described in the previous Section is used 
throughout thls chapter and parts of the next, So lt will be convenient to have 
available an M-function that accepts as inputs an image and a filter functior 
handles all the filtering details, and outputs the filtered, cropped image. The 
following function does this. 


function g = dftfilt(f，H) 

%DFTFILT Performs frequency domain filtering， 
G = DFTFILT(F，H) filters F LR the frequency domain Using the 
filter transfer function H，The output，G，1iS the 于 LIltered 
image，which has the Same Size as FF，DFTFILT automatically pads 
F to be the Same Size as H，Function PADDEDSIZE Can be USed 


和 

各 H = fredqz2(h，R，C) 
5 to determine an appropriate Size for H， 

站 

拖 


here h is a 2-D spatial filter and H is the corresponding 2-D frequency do- 
iain filter. Here, Ris the number of rows, and C the number of columns that 
e wish filter H to have. Generally, we letR = PQ(1) and C = PQ(2),3S ex- 
jained in Section 4.3.1, If freqz2 is written without an output argument, the 
bsolute value of H is displayed on the MATLAB desktop as a 3-D perspec- 
Ye plot. The mechanics involved in using function freqz2 are easily exX- 
Quned by an exampile. 


DFTFILT asSsumes that F is real and that H is a real，uUncentered 
Circularly-Symnmetric filter function， 


% 0btain the FFT of the padded input ， 
F = fft2l(f，Size(lH，1f+)，Size(lH，2) ); 


务 Perform filtering. 
g = real(LIfft2(H.*F) ); 


% Crop to original Size， 
gg=g(1:SLze(f，1)，1:Size(fT，2)) 


Consider the image, f, of Size 600 X 600 pixels shown in Fig. 4.9(a). In 
What follows, we generate the frequency domain filter, H, corresponding to 
:the Sobel spatial filter that enhances vertical edges (see Tabje 3.4). We then 
ompare the result of filtering f in the spatial domain with the Sobel mask 
Using imfilter) against the result obtained by performing the equivalent 
IOcess in the frequency domain. In practice, filtering with a small filter Hike 
obel mask wouild be implemented directiy in the spatial domain, as men- 
ned earlier, However, we Selected this filter for demonstration PuIPOSeS 
ecause its coefficients are Simple and because the results of filtering are in- 
uitive and straightforward to compare. Larger Spatial filters are handled in 
xactly the Same manner. 


Techniques for generating frequency-domain flters are discussed ina the fo 
lowing three sections. 


Obtaining Frequency Domain Filters from Spatial Filters 


In general, filtering in the spatial domain is more efficient computationally 
than frequency domain filtering when the filters are Small. The definltion 中 
sjadll is a compjlex question whose answer depends on Such factors as 节 6: 


4.4 盖 Obtaining Frequency Domain Filters from Spatial Filters “123 


egz2 


上 EXAMPLE 4.2: 
Acomparison of 
fltering ln the 
spatial and 
frequency 
domalns. 
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FIGURE 4.9 

(a) A gray-Scale 
image. (b) Its 


Fourier Spectrum. 
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Pigure 4.9(bj is an Image of the Fourier spectrum of f, obtained as follows 


>> FF = fft2( 了 f); 
>> S = fftshift(1og(1 + abs(F))); 
>> 3 = gscale(S) 


>> ImshowI(as ) 


Next, we generate the spatial filter using function fspecial: 

















h fspecial( ' Sobel ' ) 

h = 
1 0 -1 >:2dmshow(abs(H)，[ ]) 
2 0 -2 >:ftgure，imshow(abs(H1)，[ ]) 
1 0 -1 : 


jlo vewaplot of the corresponding frequency domain filter we type 
:gs = imfilter(double(f)，h); 
>> fredqz2(h) 和 
Which pads the border of the image with 0s by default. The filtered image ob- 
Rigure 4.10(a) shows the result, with the axes Suppressed (techniques for op- taimed by frequency domain processing is given by 
talning perspective plots are discussed in Section 4.5.3),.The filter itself was ob 


tained using the comImands: 


>> PQ = paddedsize(size(f) ) ; 
>> H = freqz2(h，PQ(T)，PQ(2) ) ; 
>> H1 = Ifftshift(H)》 ; 


Figures 4.11(a) and (bj show the result of the commands: 


盖 Imshow(gs，[ ]) 

” figure，imshow(gf，[ ]) 
where, as noted earlier, ifftshift is needed to rearrange the data so that 也 
Origin ls at fhe top, left of the frequency rectangle. Figure 4.10(b) shows a plo 
of abs(H1). Figures 4.10(c) and (d) show the absolute values of H and H1 过 
image form, displayed with the commands 


BTay tonajlity in the images js due to the fact that both gs and gf have neg- 
ave Values, which causes the average Value of the Images to be increased by 
he Scaled imshow command. As discussed in Sections 6.6.1 and 10.1.3, the 


4.4 瘟 Obtaining Frequency Domain Filters frornn Spatial Filters 
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FIGURE 4.10 

(aj Absolute 
value of the 
frequency 
domain 昌 lter 
cOIIeSponding to 
avertical Sobej 
mask. (b) TIhe 
same filter after 
Processlng with 
function 
fftshift.Eigures 
(c) and (dj are the 
filters in (a) and 
(b) shown as 
IInages， 
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并 If 守 lter iDro- 
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FIGURE 4.11 

(a) Result of 

和 ltering 
Fig,.4.9(aj tn the 
Spatial domain 
with a vertical 
Sobel mask. 

(P) Result 
obtained In the 
frequency domaln 
uslng the 和 lter 
shown in 

Fig. 4.10(b). 
Figures (c) and 
(d) are the 
absolute values Of 
(aj and (b)， 
TeSpecttvely. 
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Sobel mask, h, generated above 1l$ used to detect vertical edges in an lmage 
using the absoiute value of the response. Thus, it is more relevant to show the: 
absojute values of the imapges just computed, Figures 4.11(c) and (d) Show 
the Images obtalined using the comrmands 


TIhe approach just explained can be used to implement in the frequency do- 
main the Spatial filtering approach discussed in Sections 3.4.1 and 3.5.1, as wejl 
as any otherFIR Spatial fijlter of arbitrary Size. 泗 


>> figure，imshowfabs(gs)，! ]) 
2 必 昌 Generating Filters Directly in the Frequency Domain 

in this Section, we illustrate how to impiement filter functions directly in the 
frequency domaln. We focus on circularly symmetric fiiters that are Specified 
as varlous functions of distance from the origin of the transform. The M- 
fnctions developed to implement these filters are a foundation that is easily 
extendable to other functions within the Same framework. We begin by impie- 
menting several welL-known smoothing (lowpass) filters. Then, we show how 
to USe Several of MATLAB'Ss wireframe and surface plotting capabilities that 
ad in filter visualization. We conclude the section with a brief discussion of 


saarpening (highpass) filters. 


The edges can be Seen more cjearly by creating a thresholded binarIy 
Image: 


>> figure，imshow(abs(gs) > 0.2*abs(max(gs(:)))) 
>> figure，imshow(abs(gf) > 0.2*abs(max(gf(:)))) 


where the 0.2 mujtiplier was Selected (arbitrarlly) to Show only the edges with 
strength greater than 20% of the maximum values of gs and gf. Figures 4.12(a) 
and (b) show the resuljts. 


4.3 器 Generating Filters Directly in the Frequency Domain 
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FICGURE 4.]2 
Thresholjded 
VerTSlons of 
Figs.4.11(c) and 
(dj,respectively, to 
Show the Principal 
edges more clearly. 
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4.5.1 Creating Meshgrid Arrays for Use in InpPlementing Filters 
in the Frequency Domalin 


Central to the M-functions in the following discussion is the need to compnute 
distance functions 位 om any point to a specified point in the frequency Tectangle 
Because FFT compnutations in MATLAB assume that the Origin of the trans 
form is at the top,left of the frequency rectangle, our distance computations are 
with respect to that point. The data can be rearranged for visualizatlon PUTPoses 
(so that the value at the origin js translated to the center of the frequency rec 
tangle) by using functlon fftshift. 

The following M-function，which we call dftuv,，pPIovides the Decessary 
meshprid array for use in distance compnutations and other Simlilar apPlica. 
tions. (See Section 2.10.4 for an explanation of function meshgrid used in the 
following code.).The meshgrid arrays generated by dftuv are in the order re: 
quired for processing with fft2 OT ifft2, SO nO rearTanging of the data 


required 


centeT of the frequency rectangle, following the basic format explained in 
Fig. 4.2(aj. We can Use function fftshift to obtain the distances with reSpect 
0 the center of the frequency rectangle， 


忆 上 一 口 一 上 上 上 史 
oo 上 ao 


Jhe distance 18 noOw 0 at coordinates (3$, 3), and the array is symmetric abonut 


function [U，V] = dftuv(M，N) this point. 旨 


%DFTUV Computes meshgrid frequency matrizces ， 





% = omputes meshgrid frequency matrices U and 。” 工 ， 

5 和 人 人 9 和 fijlter 5 0 Frequency Domain Filters 

% functions that Can be used with DFTFILT. and V are both ideal omwpassy Filler (ILPF) has the transfer function 

% Mby-N， 

本 人 1 让 Dlzo) 大 Do 

% Set Up range of variab]les ， 0 让 Dazo) > Do 

U=0:(M 一 1T); 

v = 0:(N - 1)， ere Do is a specified nonnegative number and Dfa, yj is the distance from 


Ge 


int ( V) to the center of the filter.The locus of points for which DUO) = 
circle， Keeping in mind that filter 已 multiplies the Fourier transform of an 
Image, we see that an ideal filter“cuts off" (multiplies by 0) all components of 忆 
utslde the circle and leaves unchanged (mujtipiies by 1]) all components on, or 
lde， the circle. Although this filter is not realizable in analog form Using ejec- 
nlc components, it certainly can be Simulated in a compnuter Using the Preced- 
gtransfter function. The properties of ideal filters often are useful in explaining 
henomena such as wraparound error. 

全 Btuitermwortmiowpassy filter(BLPF) of orderm, with a cutoff frequency at a 
ktance Po from the origin, has the transfer function 


工 
1+[LDU2XDPo]7” 


Compute the indices for use in meshgrid , 
idx = findftu > M/72) 
uU(idx) = U(Uidx) 一 站) 
idy = findfvy > N/2) 1 
v(idy) = V(idy) 一 有 N; 


% Compute the meshgrid arraysS， 
[V，U] = meshgrid(v，8) ; 、 


from every point in a rectangle of size 8 X Sto the origin of the rectangle: 





>> [U，V] = dftuv(8，5) 
人 
D = 


巡 ( U) 三 


nlke the ILPF the BLPF transfer function does not have a Sharp disconti- 

uty at Do. For filters with smooth transfer functions,it is customary to define 

Sutof frequency locus at Points for which 妃 (x, y) is down to a specified frac- 

刘 of its Iaximum value. In the Preceding equation, 豆 (z v) = 0.5 (down 
和 from its maximum value of 1) when Dllz2) = 盖 )， 

Jhe Mansfer function of a Carssian /omwpass jirer(GLPF) is given by 


瓦 (u， 1) 王 0 Dj/2u2 


一 上 
下 人 人 斩 国 加 广 一 品 
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where o ls the standard deviation. By letting co = Di, we obtain the followin e can view the fiter as an image [Fig. 4.13(b 咱 by typing 

expresslon In terms of the cutot parameter Z 忆 0: 

> figure，imshow(fftshift(H)，[ ]) 
刀 (u )) 二 2 已 (if DA 
Simjlarly the Spect can be dispiayed Fig. 4.13 by typi 
When Do = Dothetfilteris down to 0.607 of its maximum value of 1. 0 和 (9 7 和 
> figure，jinshow(log(1 + abs(fftshift(F)))，[ ]) 

EXAMPLE 4.4: ”法 Asanilustration,we apply a Gaussian lowpass filter to the 300 X S00-pixet 


Lowpass filtering， image,f,in Fig. 4.13(a).We useavalue of Do equalto 59% of the padded image 


a hb 
c d 


FIGURE 4.13 和 国 
Lowpass filtering. 

(aj) Origlinal 
Image. 
















Rinally Fig. 4.13(d) shows the output image, displayed using the command 


width. With reference to the filtering steps discussed in Section 4.3.2 we have 
>> figure，inmshow(g，! ]) 


>> PQ = paddedsize(SlIze(Tf)); 

>> [U，V] = dftuv(PQ(1)，PQ(2) ) ; 

>> D0 = 0.05*PQ(2) ; 

>> F = fft2(f，PQ(1)，PQ(2) ) ; 

>> exp(-(U.*2 + V.^2)1(2*(D0^*2) ) ) ; 
>> 加 下 芋 了 开征 守 3 





As expected, this image is a bjurred version of the oTiginal， 


The foljlowing function generates the transfer fanctions of all the lowPass 
lters discussed in this Section. 


吧 工 
相 4 


时 PFILTER Computes frequency domain 1owpass filters . 

多 H = LPFILTER(TYPE，M，N，D0，n) creates the transfer function of 
a 10Wpass filiter，H，of the specified TYPE and Size {(M-by-N)，To 
View the filter as an jimage or mesh plot，it should be Centered 
USing H = fftshift(H) ， 


Valid values for TYPE，D0，and n are: 





， 国 0 
e@ S 人 @ deal， Ideal Lowpass filter With Cutoff frequency D0,， mn need 
0 not be Supplied， DO0 must be poSitive， 
image. 
(c) Spectrum of | l 陡 Si ， b 上 W， Buttefrworth Lowpass filter of order n，and cutoff 
(a). (d) Processed 人 D0，The default value for n 1s 1.0， D0 must be 
image， positive， 












gaussian” Gaussian 10wpass filtenr With Cutoff (Standard 
deviation) D0. mn need not be Supplied，D0 must be 
Positive， 


aaa3 320 a8 


和 Use function dftuv to set Up the meshgrid arraySs needed for 
-机 computing the required distances， 
出 ，V] = dftuv(M，N) ; 


.% Compute the distances D(U，V) ， 
四 有 9 全 人 仙 2 0) 


:% Begin filter computations， 
-Switch type 
asSe ideal， 
: H = double(D <= D0); 
“Case 1 btW 1 
ifnargin == 

n=]; 


function [H，D] = Lpfilter(type， M，N，D0，n) PTIilLtenr 
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Z FICURE 4.14 
Geometry for 
function View. 


end 

用 :人 DOD0 7T2E 
case “gauSSian' 

党 2 280D0、2 
otherwise 

error( Unknown filter type.') 
end 0 





Viewpoint 
























Center of 


Function 1pfilter is used again in Section 4.6 as the basis for generating 人 


highpass filters. 





4.33 Wireframe and Surface Plotting 


Flots of functions of one variable were introduced in Section 3.3.1. In the fol- 
ljowing discussion we introduce 3-D wireframe and surface pjots which are: 
useful for visuajlizing the transfer functions of 2-D filters. The easiest Way to: 
drawa wireframe plot of a given 2-D function,H,is to use function mesh， which: 
has the basic syntax 


小 


reaz=-37.5andel=30,which place the viewer in the quadrant defined by 
he -x and 一 》axes, and looking into the quadrant defined by the positive * 
and 》 axes in Fig. 4.14. 

:To determine the current viewing geometry we type 


mesh(H) 





This function draws a wireframe forx=1:Mand yY =1:N, where [M，N] 
Size(H)，Wireframe plots typically are unacceptably dense 二 M and N ar 
large, in which case we plot every kth point using the Syntax > [az，e6JL] = View;i 


mesh(H(1:kK:end，1:Kk:end ) ) b set the Viewpoint to the defaujlt values, we type 


As a rule of thumb, 40 to 60 subdivisions along each axis usually provide > view(3) 
good balance between resolution and appearance. 


MAILAB plots mesh figures in color, by default. The command The viewpoint can be modified interactively by cjlicking on the Rotate 3D 


button in the figure window?s toolbar and then cllicking and dragging in the fig- 
are window. 

”As discussed in Chapter 6, it is possible to specify the viewer location in 
Cartesian coordinates, (x, y, z),， which is ideal when working with RGB data. 
了 rowever, for general Plot-viewing Purposes, the method just discussed in- 
Volves only two parameters and is more intuitive， 


1 
0 
并 






-和 >Coejormap colormap(IO0 0 0] ) 


An 


sets the wireframe to black (we discuss function coliormap in Chapter 6 
MAILAB also superimposes a grid and axes on a mesh plot, IJhese can be 
turned off using the commands 1 








grid off 轴 Consider a Gaussian lowpass filter similar to the one used in Example 4.4: EXAMPLE 4.5: 
aXliS Off Wireframe 
piotting. 


>> H = fftshift(Ipfiliter( 'gaussian'，500，500，50) ) ， 
They can be turned back on by replacing off with on in these two statements 
Finally, the viewing point (location of the Observer) js controlled by function 


Rigure 4.15(a) shows the wireframe plot produced by the commands 
View, which has the syntax 3 
:>> mesh(H(1:10:500，1:10:500)) 


view(az，el) 2>> axis([0 50 0 50 0 1]) 





Where the axis command is as described in Section 3.3.1, except that it con- 


As Fig. 4.14 shows, az and el represent azimuth and elevation angles (in de- 
-tains a third range for the z axis， 


grees), respectively The arrows indicate positive direction.The default values 
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(aiAplot 
obtained Using 
function mesh. 
(b) Axes and grid 
TeImoved. (c) A 
different 
perspective View 
obtained using 
function VIewW. 
(d) Another view 
obtained Using 


the same functlon， 
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Thbis function Produces a plot identical to mesh, with the exception that the 
quadrilaterals in thpe mesh are filjed with colors (this is called jceted snadipnpg) 
To:conyert the colors to gray, we use the command 


coJormap(gray) 


The axis,grid,andviewfunctions worK in the same way as described ear- 
ie for mesh. For example, Fig. 4.16(a) is the result of the following sequence 


NE 
AS 
WA 


RN 
大 


-H = fftshift(lpfilter('gaussian'，500，500，50) )， 
:surf(H(1:410:500，1:10:500) ) 

axis([f0 500 5001]) 

Colormap(gray ) 

grid off;i axliS Off 


ES- 


Ra 


The faceted shading can be smoothed and the mesh lines eliminated by in- 
eTpolation Using the command 






sr 
2 ye 
SEESEsenan 
RCR 
一 一 > 
ah 
















shading LInterp 


2 





yping this command at the Prompt Produced Fig, 4.16(bD). 

When the objective ls to plot an anajytic function of two variables, we Use 
eshgrid to generate the coordinate values and from these we generate the 
lscrete (Sampled) matrix to use in mesh or Surf. For example, to plot 
he function 


Asnoted earjier in thls section, the wireframe is in color by default， trans 
tioning from blue at the base to red at the top. We convert the plot linest 
bjack and eliminate the axes and grid by typing 


>> Colormap([0 0 0]) 
>> axiS Off 
>> Grid off 


je 人 = ze 


om -2to2inincrements of0.1L for both xy and y, we Write 


[Y，X]l = meshgrid(-2:0.1:2，-2:0.1:2) ; 
汉 = X.*exp( 一 X.^2 一 Y.^2) |; 


Figure 4.13(b) shows the result， Figure 4.1$(c) shows the resujlt of 也 
coImmand 


nd hen use mesh(Z) or Surf(Z) as before. Recall from the discusslion In 
ectton 2.10.4 that that columns (Y) are listed first and TOwWS (X) Second in 
Inctlon meshgrid. 


>> view(-25，30) 


whlch moved the observer slightly to the fight, while leaving the elevation cof 
stant. Finalljy Fig. 4.13(d) Shows the result of leaving the azimuth at -25 8 
setting the elevation to 0: 


>> ViLew(~25，0) 
Thas example Shows the significant plotting power of the Simple function mesh. 


Sometimes it is desirable to plot a function as a Surface instead of as a wire 
frame.Function SuUrf does this. Its basic SyntaxX is 





Surf(H) 
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FIGURE 4.16 

(a) Pilot obtained 
using function 
Surf.(b) Resuit 
of using the 
command 
shading LIn+terp， 
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hpfiJlter 
YY 0 


了 上 XAMPLE 4.6: 
Highpass filters. 
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Sharpening Frequency Domain Filters 


Just as ljowpass filtering blurs an image, the Opposite Process, 1i8Hpasy jiitering 

Sharpens the lmage by attenuating the low frequencies and leaving the higte 
frequencies of the Fourier transform relatively unchanged. In this section w 
conslder Several approaches to highpass filtering. 


4 有 Basic Highpass Filtering 


Given the transfer function 局 pb(t 2j) of alowpass filter, we obtain the transfe 
function of the corresponding hlighpass filter by using the simnple relation 


Php( 切 三 工 一 已 p(2 2 


basls for a hlighpass filter generator, as follows: 





function H = hpfiIter(type，M，N，D0，Dn) 

#%HPFILTER Computes fredquency domain highpass filters， 
% HH=HPFILTER(TYPE，M，N，D0，n) creates the transfer function of: 
% ahighpass filter，H，of the specified TYPE and Size (M-by-N) ， 
% Valid values for TYPE，D0，and n are: 
所 





机 了 

站 

IGURE 4.17 Top row: Perspective plots of ideal Butterworth, and Gaussian highpass filters. Bottom row: 
orresponding Images. 


deal， Ideal highpass fijitenr With cutoff frequency D0.Pn 
5%5 need not be Supp1ied.， DO must be posSitive， 


“btwW'”， Butterworth highpass filter of order n，and cutoff 
D0， The default value for n is 1.0. DO0 must be 
P0OSiItivVe， 

Colormap([0 0 0]) 

axis off 

deviation) D0, n need not be Supplied.D0 must be > grid of 

poSitive 让 

The transfer function Hhp of a highpass filter is 1 - H1p， JIhe corresponding image In Fig.4.17(d) was generated using the command 

Where Hlp is the transfer function of the Corresponding 1owpasSs 

filter，Thus，we can Use function LIpfilter to generate highpass 


> figure，imshow(H，[ ]) 
fLters 


务 

咎 

务 

多 

% “gaussian Gaussian highpass filter with cutoff (Standard 
反 

先 

和 

委 
Where the thin black border is superimposed on the image to delineate its 
oundary. Similar commands yielded the rest of Fig. 4.17 (the Butterworth fil- 


n=1; 所 Default value of n， 生 拓 
teris of order 2). 枉 


end 

%% Generate hlLghpass filter 

Hip = Jpfilterftype，M，N，D0，m) ; 
H=1T-HLp; 


Tigure 4.18(a) is the same test pattern，f，shown in Fig， 4.13(a)， EXAMPLE 4.7: 
gure 4.18(b), obtained using the following commands,shows the result of ap- ”Highpass filtering. 
plying a Gaussian highpass filter to 二 : 


对 Figure 4.17 shows plots and images of jideal, Butterworth, and Gaussia 


highpass filters. The plot in Fig.4.17(a) was generated using the commands = paddedsize(Size(f) ) 


0.05*PQ(1) ; 

H = hpfilter('gaussian' ，PQ(1) ，PQ(2)，pD0); 
9 = dftfilt(f，H) 

figure，imshow(g，[ ]]) 


睛 


用 


>> 有 = fftshift(hpfilter( ideal'，500，500，50) ) ; 
>> fesht(H(1:10:500，1:10:500) ) ; 
>> axis([0 50 0 50 0 1]) 
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省 省 


FIGURE 4.18 
(aj OrTriginal 1mage. 
(b) Restilt of 
Gausslian highpass 
filtering. 


了 EXAMPLE 4.8: 
Combining high- 
frequency 
emphasls and 
histogram 
equalization. 





As Fig.4.18(b) shows, edges and other Sharp intensity transitions in the imag 
were enhanced. 再 Oowever, because the average value of an image is given b 
F(0,0), and the highpass filters discussed thus far zero-out the origin of th 
Fourier transform, the image has lost most of the background tonajity presen 
im the original, This Problem is addressed in the following section 


4.0.2 High-Frequency Emphasis Filtering 


As mentioned in Exampjle 4.7, highpass filters zero out the dc term thus re 
ducing the average value of an image to 0.An apProach to compensate for this 
ls to add an offset to a highpass filter, When an offset is combined with mnulti 
plying the filter by a constant greater than 1, the approach is called jgj- 
广 edqizency eppasis ftering because the constant multipjier highjights the 
high frequencies. The multiplier increases the amplitude of the low frequen- 
cies ajso, but the low-frequency effects on enhancement are less than those 
due to high frequencies, as long as the offset is small compared to the muItipP 
eT,. High-frequency emphasis has the transfer function 


hfe(z VD) 一 2 十 Php( 2) 


where ais the offset,p is the multiplier, and Fn(w V) is the transfer function 
of a highpass filter. 


Figure 4.19(a) shows a chest X-ray image, f, X-ray imagers cannot be fo- 
cused in the same manner as optical lenses, so the resulting images generajly 
tend to be slightly blurred. The objective of this example is to sharpen 
Fig. 4.19(a). Because the gray levels in this particuiar image are biased toward 
the dark end of the gray Scale, we 3also take this opportunity to give an exam- 


ple of how spatial domain Processing can be Used to complement frequency 


domain filtering. 
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Figure 4.19(b) Shows the result of filtering Fig. 4.19(a) with a Butterworth 


highpass filter of order 2, and a value of Do equal to 9% of the vertical dimen- 


on of the padded Image. Highpass filtering is not overly sensitive to the value 
F Do as ljong as the radius of the filter is not So Small that frequencies Rear the 
Ilgin of the transform are passed.As expected,the filtered result is rather fea- 

eless, but it shows faintly the Principal edges in the image.The advantage of 


high-emphasis filtering (with a = 0.$ and pb = 2.0inthis case) is shown in the 


image of Fig, 4.19(c),in which the gray-level tonality due to the low-frequency 
omponents was ietained. TIhe following sequence of commands was used to 
generate the Processed images in Fig. 4.19, where f denotes the input image 


[he last comimand geneTated Fig, 4.19(d)j 


_PQ = paddedsize(Size(f) ); 
> b0 = 0,05*PQ(1) ; 
>> HBW = hpfilter('btw'，PQ(1)，PQ(2)，D0，2)， 
H = 0.5 + 2*HBW; 
gbw = dftfilt(f，HBW) ， 
gbw = gscale(gbw) ; 
ghf = dftfilt(f，H)， 
>> ghf = gscale(ghf) ; 
9ghe = histeq(ghf，256) ; 


:Asindicated in Section 3.3， 2, an Image characterlized by gray levels In anar- 
ToW Iange of the gray Scale is an ideal candidate for histogram equajlization.As 


Fig. 4.19(d) shows, this indeed was an appropriate method to further enhance 





0d 


FIGURE 4.19 High- 
frequency 
emphasis filterimng. 
(aj Original image. 
(b) Highpass 
filtering reSult， 

(c) 了 ipgh-frequency 
emphasls resujt， 
(d) Image (c) atter 
histogram 
equalization. 
(Original Image 
cOUItesy of Dr. 
Thomas 及 . Gest， 
Division of 
Anatomalcal 
Sciences， 
University of 
Michigan Medicalj 
School.) 
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the Image in this example. Note the clarity of the bone structure and other de- 
talls that simply are not visible in any of the other three images.The final en 
hanced lmage appears a little noisy, but this is typical of X-ray images whe 
tbelr gray Scale is expanded.The result obtained using a combination of high: 


trequency emphasis and histogram equalization is superior to the result tb 
would be obtained by using either method alone. 


91T11T1TQ1TV 


In addition to the image enhancement applications that we used as iustrations in this 
and the preceding chapter the concepts and techniques developed in these two chap: 
ters Provide the basis for other areas of image processing addressed in subsequent dis- 
cusslons in the book.Intensity transformations are used frequentiy for intensity Scaling; 
and spatial filtering is used extensively for image restoration in the next chapter for 
color processing iD Chapter 6, for image segmentation in Chapter 10, and for extracting 







at 











descriptors from an image in Chapter 11. The Fourier techniques developed in this: 下 


chapter are used extensively in the next chapter for imapge restoration, 这 Chapter 8 fo 


Image compression, and in Chapter 11 for image description. 
























Prez1erz0 
he objective of Testoration is to improve a given image in Some predefined 
ense. Ajthough there are areas of overlap between image enhancement and 
image Testoration, the former is largely a subjective process，while image 
Testoration is for the most part an objective process, Restoration attempts to 
Teconstruct oOT ITecover an image that has been degraded by using a PriOrTi 
iowiedge of the degradation phenomenon. Thus, restoration techbniques are 
Ilented toward modeling the degradation and applying the inverse Process in 
order to recovyer the original image， 

“TIhis approach usually involves formulating a criterion of goodness that 
yields an optimal estimate of the desired result, By contrast ,enhancement 
techniques basically are heuristic procedures designed to manipulate an image 
im order to take advantage of the psSychophysical aspects of the human visual 
System, For example, contrast stretching is considered an enhanceiment tech- 
nique because it is based primarily on the pleasing aspects it might present to 
the viewer, whereas Temoval of image blur by applying a deblurring function is 
considered a restoration technique. 

JIn this chapter we explore how to use MATLAB and IPT capabilities to 
model degradation phenomena and to formnulate restoration solutions. As in 
Chapters 3 and 4, some restoration techniques are best formulated in the Spa- 
tal domain, while others are better suited for the frequency domain, Both 


methods are inyestigated in the sections that follow. 
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FIGURE 5.1 
Amodei of the 
Image degradation/ 
TeSstoTation PTOCeSS. 









































52 Noise Models 


The abihty to simulate the behavior and effects of noise is central to image 
estoration， In this chapter，we are interested in two basic types of noise 
iodels:noise in the spatial domain (described by the noise Probability density 
fnction), and noise in the frequency domain, described by various Fourier 
TOperTties of the nolse. With the exXCception of the material in Section 9.2.3, we 
SSUme in this chapter that noise is independent of image coordinates, 





则 AModel of the Image Degradatiom/Restoration Process 


As Fig. 3.1 Shows, the degradation process is modeled in this chapter as 3 
degradation function that, together with an additive noise term, Operates on 
an input image 帮 x, y) to produce a degraded image 8(X， 7): 


8 y) = 已 [iDI] TOO 


Given 8g(X, y) ,Some knowledge about the degradation function 五 , and som 
knowledge about the additive noilse term ?7{(x, y),the objective of restoration ist 
obtain an estimate， 太 (x, y), of the original image. We want the estimate to be 
close as posslble to the original input image. In general the more we know about 
如 andm, the closeT (xx， y) will be to _ 广 (xy). 

于 区 isalnaearspatialiy 加 variai brocess'it can be Shown that the degrade 
Image 1s given in the Spatal doraajpmn by 


8 y) = Pt y) 太 ty) 十 Tcy) 


wpere AP(x, y) ls the SPatial representation of the degradation function and,a 
In Chapter 4, the Symbol “*”indicates convojlution. We know 位 om the discus 
sion in Section 4.3.1 that convolution in the spatial domain and mujtiplicatiol 
in the frequency qdqomain constitute a Fourier transform pair, so we may wiit 
the preceding Imnodel in an equivalent 方 eqgxyercy doraain representation: 


CUV) = 万 (2V)E(OV) 十 NOT) 


where the terms in capltal letters are the Fourier transformas of the correspondin 
terms in the convolution equation. Ihe degradation function 瓦 (x, ww) sometime 
is called the optfical trarmsjer jnction (DOTIF), a term derived from the Fourie 
analysls of optical systems. In the spatial domain, P(xz, y) is referred to as 也 
Poinf Spread jnrctiomr (PSF),a term that arises from letting Am(x, y) operate on 
point of light to obtain the characteristics of the degradqation for any type 0 
Input. TIhe OTF and PSF are a Eourier transform pair, and the toolbox Provide 
two functions, otf2psf and psf2otf,for converting between therm， 

Because the degradation due to a linear, Space-invariant degradation func 
tion, 五 , can be modeled as convojlution, sometimes the degradation process 1 
referred to aS“convolving the image with a PSF or OTIE”Similarly, the Testora 
tlon Process is SoOmetimes referred to as decoFyvoiutzo7， 

in the toljlowing three Sectlons, we assume that 妃 ls the ldentity operator 
and we deal only with degradation due to noise. Beginning in Section 9.6 we 
look at Several methods for image restoration in the Presence of both 囊 andy 


:2.] Adding Noise with Function imnoise 


e toolbox Uses function imnoise to corrupt an imapge with noise. This func- 
on has the basic Syntax 





g = imnnoise(f，type，parameters) 


here f ls the input image, and type and parameters are as explained later， 
unction imnoise converts the input image to ciass double in the range [0, 1] 
before addling noise to it. This must be taken into account when specifying 
ose parameters. For example,to add Gaussian noise of mean 64 and variance 
0 to an uint8 1mage，we Scale the mean to 64/255 and the variance to 
)J0/(255) for input into imnoise, The syntax forms for this function are: 


g = imnoise(f，'gaussian' ,myvar) adds Gaussian noise of mean mn 
and Variance var to jimage f. The default is zero mean noise with 0.01 
VarTlance， 

g=imnoise(f，'1ocalvar' ,，V) adds zero-mean, Gaussian noise of local 
Varlance, V,to image f, whereVis an array of the same size as f containing 
the desired variance values at each point. 

g= inmnoise(f，'1ocalvar'，image_intensity，Vvar) adds zero-mean， 
Gaussian noise to image f, where the local variance ofthe noise,var,is afunc- 
tion of the image intensity values in f.The image_intensity and var arpgu- 
ments are Vectors of the same Size,and plot(image_intensity,， var) plots 
the functional relationship between noise variance and image intensity The 
image_intensity vector must contain normajized intensity values in the 
Iange [0, 1]. 

9g9=inmnoise(f，'sSalt & pepper'，d) corrupts image f with salt and 
Pepper nolse, where d is the noise density (ie., the Percent of the image 
area containing nolise values). Thus, approximately d*numel(f) pixels are 
affected. IThe default is 0.05 noise density. 
gs=imnoise(f，'speckle'，var) adds multiplicative noise to Image 了， 
“using the equation g = 下 + nxf, where n is uniformly distributed random 
”noise with mean 0 and variance var.The default value of var is 0.04. 
9g=innoise(f，'poisson') generates Poisson noise from the data instead 
of adding 二 nolse to the data. In order to comply with Poisson statis- 
fics, the intensities of uint8 and uint16 Images Inust correspond to the num- 
ber of photons (or any other quanta of information). Double-precision 
-Images are used when the number of Photons Per Pixel 1 larger than 65935 
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-In MATILAB this result is easily generalized to an M x N array, R, of ran- 


(but less than 101). The intensity values vary between 0 and 1 and corre-. 
om numbers by using the exXpression 


spond to the number of photons divided by 10!. 


Several illustrations of imnoise are given in the following Sections. > R=ar+SsSqrt(b*1og(1 ~ rand(M，N)))， 


heie, as discussed in Section 3.2.2, 1og js the natural logarithm, and, as men- 
oned eariier rand generates uniformjy distributed random numbers in the inter- 
val (0, 1 站 下 we letM=N=1,thenthe preceding MATLAB command line yields a 
single value from a random variable with a Rayleigh distribution characterized by 

arameters Zand 二 


3.2.2 Generating Spatial Random Noise with a Specified 
Distribution 


Often, it is necessary to be able to generate noise of types and Parameters be 
yond those available in function imnoise.Spatial noise values are Tandom num 
bers, characterized by a probability density function (PDF) or equlvalently, b 
the _ corresponding cumulative distribution function (CDE). Random numbe 
generation for the types of distributions In which we are interested follow somt 


fairiy simpje rujles from Probabijity theory. 遇 
e11eE1ratOr EUQHOH because it est 1 - 
Numerous random number generTators are based on expressing the genera b ， ， Nd ee [698ed 9 
m numbers. In this particular case，we were able to find a_ closed-form 
tion Probjem in termas of random numbers with a unliform CDFE in the Interva oution As will :,。; : 
ee olution. As w 弄 be shown shortly, this is not always possible and the problem 
(0, 1), In some instances, the base Tandom number generator of cholce 18 5 
then becomes one of fnding an applicable random number generator equation 
generator of Gaussian random numbetrs with zero mean and Unit Variance 。 . 
是 有 ose Outputs w 则 approximate random numbers with the specified CDE 
Although we can generate these two types of nolse USing imnoise'lt 1 mor Table511i 上 
Tabie > sts the random vafiables of interest in the present discussion, along 
meaningful in the present context to use MATILAB function rand for uniform with their PDES C ; 
With their s CDFs and random number generator equations. In sorme cases 
random numbers and randn for normal (Gausslian) random numbers. Thes 全 人生 二 . 
RS th the Rayjeigh and exponential variabjles,it is possible to find a closed-form 
functions are explained later In thbls Sectlon. 生生 全 本 ie . 
本 三 本 汪汪 olutlon for the CDF and its inverse. This allows us to write an expression for the 
The foundation of the apPproach described in this Section is a well-knownm: andom number generator in terms ofuniform rand 、 ， 
result 人 from Probability (Peebles [1993]) which states that 让 18 a uniform 2 lss3 2 ne LIustrated 
aa xampie >.. Jn others, as in the case of the Gaussian and lognormal densities 
istributed random variabjle in the interval (0, 1), then we can obtain a Ian Se ， 
le ae Gin eegbia 0sed-torm solutions for the CDF do not exist, and it becomes necessary to find 
P ) iteInate Ways to generate the desired random numbers. In the Ilognormal case， 
区 Fz1( 20) of instance, We make use of the knowledge that a lognormal random variable， 
| 本 uch tbat ]m ian distributi | 
This simple, yet Powerfujl, result can be stated equivalently as finding a solu: able 5 ] ; (z) has a 和 distribution 8 the exXPression Shown in 
站 able >.] in terms of Gaussian random variables with zero mean and unit vari- 
tion to the equation 到 (z) = 刻 . ie Yi 
ance， et im other cases, it js advantageous to reformnujate the Problem to obtain 
neasler solution. For example, it can be shown that Erlang random numbers 
th parameters 4 and bp can be obtained by adding 六 exponentially distributed 
andom numbers that have parameter a (Leon-Garcia [L994]). 
The random number generators available in imnoise and those shown in 
able 5.1 Play an important role in modeling the behavior of random noise 训 
“Be-pIOcessing apPlhcations, We already saw the usefuiness of the uniform 
tribution for generating random numbers with various CDFs. Gaussian 
记 Se ls used as an apProximation in cases such as imaging sensors opPerating at 
0 lght levejls. Saltrand-pepper noise arises in faulty switching devices. The 
Je of Silver Particles in a photographic emulsion is a random variable de- 
by a lognormal distribution. Rayleigh noise arises in range Imaging， 
Le eXpPonential and Erlang noise are useful in describing noise in laser 
aging. 
M-function imnoise2, listed later in this Section, generates random num- 
TS having the CDFs in Table 5.1.This function makes use of MATLAB func- 
Sn rand, which, for the purposes of this chapter, has the Syntax 


:The exXpresslionz = 4 二 Vpln(l -mW)sometimes is called arandora Po- 


EXAMPLE 5.1: 冉 Assume that we have a generator of uniform random numbers, ww, ip the 兽 : 
Using unlform terval (0, 1), and suppose that we want to use it to generate random DuUmbers, 2 
random numbers 。” with a Rayleigh CDF which has the form 
to generate 

random numbers 2 EC 
with a specified 2 

distribution， 0 





forz 三 4 
forz < 4 


To find z we solve the equation 
1 一 SS 风 一 1? 


z=a+Nbln(l -人 切 ) 


Because the square root term is nonnegative, We are assured that no values of 
less than a are generated. This is as required by the definition of the Raylei 丘 
CDFThus, auniform random number 刀 from our generator can be used in b 
previous equation to generate a random variable xz having a Rayleigh distnbt: 
tion with Parameters a and D. 


OT 


A = rand(M，N) aid 
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十 叱 5 


(The Es are exponential 
random numbers 


is function generates an array of Size M x N whose entries are uniformaly dls- 
utednumbers with values in the interval (0. 1T).IfNis omitted it defaults to 
called without an argument, rand generates a Single random number that 


-Generatori 
玫 1 十 正 ? 十 …: 





A= randn(M，N) 


MATLAB function rand 
with Some additional logic 


MATLAB function 
MATLAB function 


六 and 


之 二 人 


wing Syntax formas: 


forz < 4 
for& < 关 z<< 也 


I=Tfind(A) 
fir，cj = flnd(A) 
[Fr，c，Vvl=TfTind(Al) 





| 二 er(z-92p z0 


nonzero values of A as a column vector, V， 
The first form treats the arrayAin the format A(:),So Ials acolumn vector， 


上 总 
口 忌 
思 
人 
一 卡 
f 
co 





% 
翅 
总 
所 
四 
忆 
所 
所 
局 
= 


aP， 十 六 P 
性 = (aa 一 PP+(b5 -PP 


阁 ? 一 


= find(A >= 64&A<= 192); 
AI) = 128; 


otherwise 
z>10 


z 羡 0 
z 履 0 


he first two forms of function find are used fredquently in the remaining 
hapters of the book. 

Unjike imnoise,the following M-function generates an M x Nnoise array,R 
hat is not scaled in any way.Another major difterence ls that imnoise outputs 
sy image, while imnoise2 produces the noise pattern itse 引 The userT SPeci- 
es the desired values for the noise parameters directiy Note that thbhe noise 
ay Iesujting from salt-and-pepper nolse has three values; 0 corresponding to 
eT holse, 1] corresponding to sajt noise, and 0.3 corresponding to no noise. 


er[in(z)-e]4 252 


Otherwlse 
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27TTzz 
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TABLE 5.1 Generation of random variables. 


Unaiforna 
(aussian 
ELognorni3l 
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imnotise2 
II 


















et default valLues ， 
nargin 三 兰 
a=0;b=1]; 


This array needs to be processed further to make it useful. For example, to co 
rupt an image with this array, we find (using function find) all the coordina 
in R that have value 0 and set the corresponding coordinates in the image to th 
smallest possible gray-level value (usually 0). Similarly we find ai the coordi 
nates in R that have value 1 and set all the corresponding coordinates In thes 
image to the highest possible value (usually 2S5 for an 8-bit image).TIhls Proce 
simulates how salt-and-pepper noise affects an image in Practice. 


function R = imnoise2(type，M，N，a，b) 
%IMNOISE2 Generates an array of random numbers with s$pecified PDF ， 56 "Uniform 

R = IMNOISE2(TYPE，M，N，A，8) generates an array，R，of Size -Rs=ar+t(b~-a)xrand(M，N); 
M-by-N，whose elements are random numbers of the specified TYPE Case 'gauSSian， 

with parameters A and 8B， If only TYPE is included in the 人 
input argument 1List，a single random number of the specified 0 
TYPE and default parameters Shown below is generated. If only 
TYPE，MH，and N are provided，the default parameters shown below- 
are Used， IfM=N=1，IMNOISE2 generates a Single randonm 
number of the specified TYPE and parameters A and B. Check to make Sure that Pa + Pb is not > 1. 
ff(a+b)>1i 

Valid values for TYPE and parameters A and 8 are: error( 'The Sum Pa + Pb must not exceed |, ') 
nd 


deviation B，The default values are A= 0,，B=1| 
'salt & pepper' Salt and pepper numbers of amplitude 0 With 
probabjility Pa = A，and amplitude 1 with 
probability Pb = B，The defaujlt Values are Pa = 
Pb=A=8B=0.05， Note that the noisSe has 


5 a，The coordinates of these points we cal1Ll 0 (peppenr 


有 >88g&<= (ar+b)，These we call 1 (Salt noise)， 
rand (M，N) ; 
= find(X <= 3) ; 


values 0 (with probability Pa = A) and 1 (with cC) = 0; 

probability Pb = B)，sSo scaling is_ necessary 1jf = a + bi 

values other than 0 and 1 are required，The nois CC=findX >a&X<=U); 
matrix R is assigned three ValueSs， If ROX，y) =: R(C) = 1 


0，the noise at (X，y) is pepper (black)， II{ Se 了 0gnormal， 
R(X，y) = tf，the noise at (X，y) is Salt - 
(white). If R(X，y) = 0.5，there is no noise 


assigned to coordinates (X，y) . -end 
'1ognormal， Lognormal numbers with offset A and shape Rs= axexp(bxrandn(M，N) ) ; 
parameter B. The defaults areA= 1andB= Payleigh' 
0.25， 用 a + (~b*1l1og(1 -~ rand(M，N))),^0.5; 
ray1lLeigh， Rayleigh noise With parameters A and B，The ase ,exponential， 
default Values areA=0andB= 计 nargin <= 3 
'exponential' Exponential random numbers with parameter A， Th 3 as= 1 
default is A = 1 nd 
erlang Eriang (gamma) random numbers With parameters 人 A fa<=10 


and B， 8 must be a positive integer. The 
defaults areA=2andB=5， Erlang random 
numbers are approximated as the Sum of 8 
exponential random numbers . 


end 
k = -1/a， 
R = = kx*l1og(1 ~ rand(M，N) )， 


Oo 


-error('Parameter a must be positive for exhponential type， 


5.2 罗 Noise Models 


Uniform: Uniform random numbers in the interval (A，B) (MTIN) = 0.5; 
The default values are (0，1). 5 Generate an M-by-N array of uniformly-distributed random numbers 
gaussSian， Gaussian random numbers With mean A and Standard in the range (0，1)，Then，Pa*(Mx*N) of them will have Values <= 


n01Sej)， Similarly，Pb*(M*N) points will have values in the range 
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EX AMPLE 5.2: 
Histogramts of 
data cenerated 
using the function 
Imnozse2. 
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CaSe “erlang 2500 
if nargin <= 3 光 风 二 全 二 
6 二 关 2000 
end 
if (b ~= round(b) | b <= 0) 
error( Param b must be a positive integer for Erlang，') 1000 
end 
k = -1/a; 
R = zeros(M，N) ; 2 
for j〗 = 13:b 0 
R=R+Kkrloglt 一 rand(M，N))， 0 010203040506070809 1 
end 
otherwise Eee 
errort Unknown distribution type.') 5000 
end 人 
4000 
村 Figure 9.2 shows histograms of all the random number types in Table 3000 
The data for each plot were generated Using function imnoise2. For exaInpl 2000 
tbhe data for Fig. 5.2(a) were generated by the folowing command: 
1000 
>> ”= imnoise2( gaussian'，100000，1，0，1) 0 
和.05 -1 5 -2 25 3 35 和 05 5 
Thils statement generated a column vector，r, with 100000 elements，eak 
beling a random number from a Gaussian distribution with mean 0 and st 2 9000 
dard deviation of 1. The histogram was then obtained using furiction hist 8000 
which has the syntax 7000 
6000 
pP = hist(Fr，bins) 5000 
4000 
whbere bins ls the number of bjns. We used bins = 50 to generate the his 3000 
tograms in Fig. 9.2. The other histogranmts were generated in a simjlar manner 0.5 2000 
in each case, the parameters chosen were the default values iisted in the ex 1000 
Planation of function imnoise2. 0 2 4 6 8 10 1 14 人 2 4 6 8 10 12 


3.<.3 Periodic Noise 


Periodic noise in an image arises typically from electrical and/or electromechani 
cal interference during image acquisition. TIhis is the only type of spatially depen 
dent noise that will be considered in this chapter. As discussed in Section 5.4 
periodic noise is typically handled in an image by ftering in the frequency do 
malmn, Ourimodei of periodic noise is a 2-D sinusoid with equation 


= 4sinl27totx + Br + 27rum(y + 有)/N] 


Which we see is a pair of compliex conjugate impulses located at 
亿 十 zu + and (一 mu 一切),respectively. 

"The following M-function accepts an arbitrary number of impujlse locations 
(frequency coordinates), each with its own amplitude, frequencies, and phase 
dsplacement parameters, and computes r{x, y) as the sum of Sinusoids of the 
form described in the previous paragraph. The function also outputs the Fourier 
transform of the sum of sinusoids， Ra,b),andthe Spectrum of Ru 2). TIhe sine 
Waves are generated from the given impulse location information via the inyerse 
DFT This makes it more intuitive and simplifies visualization of frequency con- 
ent in the Spatial noise pattern. Only one pair of coordinates is required to de- 
fne the location of an impulse. The program generates the conjugate SYymmetIlc 


让 了 


where 4 is the amplitude, wo and 2 determine the sinusoidal frequencies witb 
ITesSpect to the Y- and axis, respectively, and Band 已， are phase displace- 
ments with respect to the origin. The AM X N DFTofthis equation is 


和 
RD) 一 姜 [(e2 ev)80 + tu ++ 2 罗 ) 一 (er -wu 一 2)] 


5.2 额 Noise Models 
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FIGURE 5.2 
Histograms of 
random numbers: 
(a) Gausslan， 

b) unliform， 

(c) lognormal， 
(d) Rayleigh， 

(e) exponential， 
and ( 菩 Erjang. In 
each case the 
detault 
Parameters jisted 
in the explanation 
of function 
imnoise2 werIe 
Used. 


we 
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figures 3.3(aj and (b) show the Spectrum and spatial sine noise pattern EXAMPLE 5.3: 


herated using the following commands: Using function 
imnoise3. 


impulses. (Note in the code the use of function ifftshift to convert the ce 
tered R into the Proper data arrangement for the ifft2 operation, as discussed 
in Section 4.2.) 
C = [0 64; 0 128; 32 32; 64 0; 128 0; -32 32]; 
[fr，R，S3S] > imnoise3(512，512，C) 

imshow(S，[ ]) 

figure，imshow(r，[ ]) 


imnoise3 function [FrF，R，S] = imnoise3(M，N，C，A，B) 

we %IMNOISE3 Generates periodic noise 

[Fr，R，83] = IMNOISE3(M，N，C，A，B)，generates a Spatial 
Sinusoidal noise pattern，r，of Size M-by-N，its Fourier 
transform，R，and Spectrum，S. The remaining parameters are as 
fo1LLOWS : 








]lthat the order of the coordinates js (z,v). These two values are speci- 
with reference to the center of the frequency rectangle (see Section 4.2 for 
definition of the coordinates of this center point). Figures $.3(c) and (d) 


-by-2 matr:i ntainj air frequency domaa 。 
| how the result obtalined by repeating the previous commands, but with 


coordinates (U，Vv) indicating the locations of impulses in the 
frequency domain，These locations are With respect to the 
frequency rectanglie center at (M/2 + 1，N/2 + 1)，0nly one pair 
of coordinates is required for each impuise. The progranm 
automatically generates the locations of the Conjugate Symmetri 
impujises，These jimpulse pairs determine the frequency content 
of 『， 


”= [0 32; 0 64; 16 16; 32 0; 64 0; -16 16]; 
larly, Fig. 5.3(e) was obtained with 


= [6 32; -2 2]; 


A is a 1-by-k vector that contains the amplitude of each of the 
K impulse pairs， If A is not included in the argument，the 
defaujt used is A = ONES(1，K)， B is then automaticalliy Set to 
its default Values (See next paragraph)， The value specified 
for A(]j) is associated With the coordinates in C(j，1:2)， 


B is a K-by-2 matrix containing the Bx and By phase componentS 
for each impulse pair，The default values for B are B(1:K，1:2) 
= 0， 


% Process input parameters ， 
[K，n] = Size(C) |; 
if nargin == 3 


器 oopdodpododoudo upe yo yp 


eduency component. 圳 





、 四 4 Estimating Noise Parameters 
elseif nargin == 4 ne paraImneters of Periodic noise typically are estimated by analyzing the 
B(1:K，1:2) = 0; ulerT Spectrum of the image. Periodic noise tends to produce frequency 


让 es that often can be detected even by visual inspection. Automated analy- 
18 possible in situations in which the noise spikes are Sutficiently Pro- 
Oounced, or when some knowiledge about the frequency of the interference is 
allabje. 
n the case of noise in the spatial domain, the parameters of the PDF may 
eknown partially from sensor Specifications, but it js often necessary to esti- 
tte them from sample images. The relationships between the mean, mm, and 
arlance, o“, of the noise, and the parameters a and requlred to cotmpletely 
pecify the noise PDFs of interest in this chapter are listed in Table 5.1. Thus， 


end 


5 Generate R. 

R = zeros(M，N) ; 

for j = 1:K 

U1 = M2+1+CGj，1); 

R(U1，v1) = 工 * (A(j)y/2) 
sompJlLex Conjugate . 8 
uc = Me+1-CUj，1)iv2=N2+1--C0，2)， 

RU2，v2) = (AUj)/2) * exp(ix*2*pi*C(j，2) * 8B(j，2)7N); 


v1 = N2+1+Cj2); 
”exp(I*2*pi*C(j，1) * BOj，1)7N) 


end problem becomes one of estimating the mean and variance from the samr- 
Compute Spectrum and spatial Sinusoidal pattern， Image(s) and then tising these estimates to solve for ua and 六 . 

S = abs(R); | Tet zi be a discrete random variable that denotes intenslty levels in an 
”= real(ifft2(ifftshift(R) ) ); nage， and let pf(zih),L = 0,1,2,..., 了 一 1bethe corresponding normalized 
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ompute the histogram of the pixels in the ROI. 
imhist(f(B) ) ; 
0btain the number of pixels in the ROI if requested in the output. 


rgout > 1 
npix = SUm(B(:)) 


interest (ROT) in MATLAB we use function roipoly，which generates : 
poljygonal ROI. This function has the basic SYDtaX 


8B = roipoly(f，c，P) 


where f is the image of interest and c and r are Vectors of corresponding (S 
quential column and row coordinates of the vertices of the polygon (note thatge 
columns are specified first).The output, 8B, Is a binary image the same size 38 趟 
with 0's outside the reglion of interest and ls insjide Jmage Bis used as a mast 
to limit operations to within the Teglon of interest. 

Jo specify a polygonal ROI Interactively, we use the syntax 


ipgure 9.4(aj) shows anoisy image,denoted byf in the following discussion. 下 XAMDPLE 5.4: 
ssThe objective of this example js to estimate the noise tyPe and its Parameters 
ising the techniques and tools developed thus far. Figure 5.4(b) Shows the 

asK,B, generated interactively using the command: 


B = roipoLy(f) 
B，c，r] = rolpoly(T) 


ipgure 5.4(c) was generated using the commands 


[p，npix] = histroi(f，c，r); 
double-cjlick adqs a final vertex to the selection and starts the Hi of the polyg Ts >> figure，bar(p，1) 


[na] 





a VertexX， 
To obtain the binary Image and a list of the Pojlygon vertices, we use 也 
CoOlistruct 


长 


FIGURE 5.4 

(a) Noisy image. 
(b) ROI 
generated 
inteTactively. 

(c) Histogram of 
ROL. 

(d) Histogram of 
Gaussian data 
generated Using 
function 
Imnoise2. 
(Original image 
coOUItesy of Lixi， 
Inc.) 


[8，Cc，r]j = roipoly(。， ， 辆 


onal Tegion whose vertices are specified by vectors c and r, as in the preceding 
discussion. Note the use within the PIOgIamnl of function roipoly to duplicate 
the polygonal region defined by c and r. 








140 
function [p，npix] = histroi(f，c，F) 120 
HTSTROI Computes the histogram of an ROIT in an image， 100 
名 [P，NPIX] = HISTROT(F， C，R) computes the histogranm，P，of a 
5 polygonal region of interest (ROI) in image F，The polygonal 30 
泊 region is defined by the coIlumn and row coordinates of ItS 60 
5 Vertices，which are specified (Sequentially) in vectors C and R， 汪 
respectively， Al1 pixels of F must be >= 0 Parameter NPIX is the 
5 number of pixels in the polygonal region， 20 
5 Generate the binary mask image， 0 0 200 300 


roipoly(f，c，F)i 
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3.3 避 Restoration in the Presence of Noise Only 一 Spatial Filtering 































The mean and variance of the region masked by B were obtained as follows: Spatial Noise Filters 


able $.2 lists the Spatial filters of interest in this section, where vv denotes an 


>> [YV，uUnvj = Statmoments(h，2) ; 
x Subimage (Tegion) of the input noisy image,g.The subscripts on 9 indi- 





>> V 
te 和 hat the Sublmage is centered at coordinates (X, y), and F(x, y) (an esti- 
ate of 轧 denotes the filter response at those coordinates. The linear filters 

Re Re are implemented using function imfilter discussed in Section 3.4. The 

>> Unv edian， max, and min flters are noniinear, order-statistic 三]ters. The median 


ter can be implemented directly using IPT function medfilt2.The max and 
in filters are Implemented using the more general order-filter function 
dfilt2 discussed in Section 3.9.2. 

The following function, which we call spfilt,performas filtering in the spa- 
ldomain with any of the filters listed in Table 5.2. Note the use of function 
incomb (mentioned in Table 2.5) to compnute the linear combination of the 
nputs. The Syntax for thls function iS 


147.7430 410.9313 


It ls evident from Fig. 9.4(c) that the noise is approximately Gaussian 
general,it is not possible to know the exact mean and variance of the noise b 
cause lt is added to the gray levels of the image in region B. However, by 
lecting an area of nearly constant background level (as we did here), an 
because the nolilse appears Gausslan,， we can estimate that the average gra 
level of the area B is reasonably close to the average gray level of the ; Inmaag 
without nolse, indicating that the noise has zero mean. Also, the fact that 刀 
area has a nearly constant gray level tells us that the variability in the regio 
deftined by Bis due primarily to the variance of the noise. (When feasible an 
other way to estimate the mean and variance of the noise is by imaging a tai 
get of constant, known gray level.) Figure 35.4(d) shows the histogram of as 
of npix (thlis numberis returned by histroi) Gaussian random variables wi 
mean 147 and variance 400, obtained with the following commands: 





B = imnlincomb(c1，Al1，c2，A2，.， . .,，ck，AK) 
ich implements the equation 
B = C1*Af + C2*A2 + ， :+ CKxAK 


ere the c's are real, double scalars, and the As are numeric arrays of the 
ne class and size. Note also in subfunction gmean how function Warning can 
eturmed on and o 圭 . In this case, we are SuUppressing a warning that would be 
sued by MAILAB iithe argument of the 1og function becomes 0. In general， 
arning can be used in any PTogram.The basic syntaxX is 


>> X = imnoise2('gaussian'，npix，1，147，20)， 
>> figure，hist(X，130) 
>> axlis([0 300 0 140j]) 


where the number of bins in hist was selected so that the result woujldb 
compatiblje with the Plot in Fig. 5.4(c). The histogram in this figure was ob 
tained within function histroi using imhist (see the preceding code), whi 
employs a diftferent scaling than hist. We chose a set of npix random vari 
ables to generate X, So that the number of samples was the same in both his 
tograms. The Similarity between Figs. 3.4(c) and (d) clearly indicates that th 
nolse is indeed well-approximated by a Gaussian distribution with parameters 
that are close to the estimates v(1) and v(2)， 





warningf( message ') 本 


ns function behaves exactly like function disp, except that it can be turned 
and of with the commands warning on and warning off. 


nction f = Spfilt(g，type，m，n，parameter) spfilt 
SPFILT Performs 1inear and nonlinear Spatial filtering， 
F = SPFILT(G，TYPE，M，N，PARAMETER) performs spatial filtering 
of image G using a TYPE filter 0f Size M-by-N， Valid calls to 


了 estotation in the Presence 
SPFILT are as ff0110WS， 


of Noise Only 一 Spatial Filtering 





When the only degradation present is noise, then it follows from the model 这 F = SPFILT(G，'amean' ， 旭 ，NN) Arithmetic mean fiitering. 
Section S.1 that F = SPFILT(G，'gmean'  ，M，NN) Geometric mean filtering， 
F = SPFILT(G，'hmean' ，M， Harmonic mean filtering， 
SUxz, y) = xy) +T(Oxz,y) F = SPFILT(G，'chmean' ，W，N，Q) Contraharmonic mean 


filtering of order Q，The 


The method of choice for reduction of noise in this case is spatial filterinpg， LSI 多 
default is Q = 1.5. 


techniques similar to those discussed in Sections 3.4 and 3.5. In this section we sunt 


F = SPFILT(G，'median'，MNN Median filtering， 

marlze and Implement several spatial filters for noise reduction.Additional deta 涝 已 专 ie 由 N) Max filtering 
。 。 时 
on the characteristics of these filters are discussed by Gonzalez and Woods [2002 : F = SPFILT(G，'min'， 人 L，N) Min filtering. 
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function 咎 = gmean(g， mn a 让 
% ImplementSs 3a geometric mean filter. C 中 
inclass = Class(g); 四 8 区 
g = im2doubletgj) FIGURE 5.5 
%5 DilSab1e 10g1(0) Warningd , (3) Imasge 


coOITIUPted by 
pepper noise with 
PIobabllity 0.】. 
(b) [Image 
corrupted by salt 
nolse with the 
Same probablility， 
(c) Result of 


warnling off ; 

f = exp(infilter(10g(g)，onesfm n)， Preplicate )) (人 17 An) 
Warning on; 

f = Changeclassflnclass，f) 


function 咎 = haprmean(g， mn) 
% Implements a harmonic mean fijiter， 


inclass = class(g); 人 filtering (a) witha 

g = im2doubjefg); 区 RS 3X3 

f=mxn,.Ainfilter(l./(gr+ eps)，ones(m， nj)， Preplicate ) ; ee 

f = changeclasstfinclasSs，Tf) |; liter of OrGer 
ee 2 O = 1.5.(d) 

RON 全 Result of filtering 


(b) with 

O = 一 1.9， 

(e) Result of 
filtering (a) with a 
3 X3maxfilter， 
( 扫 Result of 
fltering (b) with 3 
3 X3mmnflter. 


function 下 = charmean(g，m， n，q) 

委 Implements a Contraharmonic mean fl1Tter， 

ncClass = ClassS(g); 

im2doubjle(g) ; 

imfilter(g, (qt1)，ones(m，n)， "replicate ) ; 

f ./ (inmfilter(g.^q，ones(m，n)，'"repLicate') + epSs); 
changeclass(inclass， 了 于 ) ; 


有 


1 


工 
9 
二 
本 
于 


function f = aJphatrim(g， ff nd) 
% ImpJements an alpha-trimmed mean filltef， 
inclass = Classf(g); 





9 = im2double(g); 
f= infilter(g，ones(m，n)， Synmmetric ) ; 
for kK = 1:d/2 
f = lmsubtract(f，ordfilt2(g，Kk，ones(tm，n)， Symnmetric ) ) ; 
end ， 


fork= (nn 一 (dg/2) + 1T)3mxn 
f = imsubtract(f，ordfllt2(g，K，0nes(m，n)， SYmmetriIc ) ) ; 


end 
下 
f = changeclass{inclass， 了 ) ; _ -er 





EXAMPLE 5.5: 。 濡 The image in Fig, 5.5(a) is an uint8 image corrupted by pepper noise 0 
Using function with probability 0.1.This image was generated using the folljowing command 
spfilt， [f denotes the original image, which is Fig. 3.18(a)]: 





>> [M,，N]j = Sizel(f) 

>> R = lmnolise2( Salt & pepper ，M，N，0.1，0) 
>> C = find(R == 0) | 

2 

>> gp(c) = 0 





Salt & pepper'，M，N，0，0.1); 
和 


Statements 
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A good approach for filtering Pepper noise is to use a contraharmonic ilt 


>> fpP = Spfilt(gp， "chmean' ，3，3，1.5) |; 


similarly, salt noise can be filtered using a contraharmonic filter with &a neg 
tve value of C: 


>> fS = Spfilt(gs， 'chmean'，3，3，-1.5) | 


Figure 9.3(d) Shows the result. Similar results can be obtained using max 1 
mln filters. For example, the images in Figs. .5(e) and ( 们 were generated fraol 


Flgs,.3.5(a) and (bj,Tespectively, with the following comrmands: 
GURE 5.6 (aj IJmage corrupted by Salt-and- -Pepper nolse with density 0.25. (b) Result obtained using a 
dan flter of SIZe 了 X 光 《c) Result obtained sing adaptive median 加 tering with Oomax = /， 


>> fpmax 
>> 了 于 Smzn 


Spfilt(gp， max' ，3，3) 
SpO9S5 人 有 7 沁 ) 


Other solutions using spfilt are impiemented in a similar manner. 


2.3.2 Adaptive Spatial Filters 


The flters discussed in the previous section are applied to an inmaage witho 
Tegard for how Image characteristics vary from one location to another: 
Some appjllcations, results can be Improved by using filters capable of adaptin 芭 
their behavior depending on the characteristics of the image in the area be 也 名 
filtered. As an jlustration of how to implement adaptive spatial filteTrs 
MATLAB, we consider in this section an adaptive median filter.As before， 
denotes a Subimage centered at location (x, y) in the image being Processei 


The algorithm, which is explained in detail im Eee and Woods [2002],: 2 
follows: Let 1gure 3.0(a) Shows the circuit board image，f，corrupted by salt-and- EXAMPLE 5.6: 


二 Depper noise, generated using the command 交 median 
Literng， 


f = adpmedian(g，Smax ) adpmedian 


zmnin = IIinImUm intensity value jn 9 ，， 


<max 二 maximum Intensity value in 7 imnojise ( 后 Sa1t 义 Pepper 25 ) 
zmed = median of the intenslty values in Sr 1 
zy = intensity value at coordinates (X, )) ndFig.5.6(b) shows the result obtained using the comimand (see Section 3.5.2 


The adaptive median filtering algorithm works in two levels, denoted leve| 


and jevel 妃 : 
= medfilt2(g，[7 7]， 'Symmetric” ) 
Level 4: If znin < xzned < zmaxy go to level 局 
Else increase the window S1Ze S lmnage 1s reasonably free of noise, but it is quite blurred and distorted 
于 window Size 三 max Tepeat level 人 4 8 See the connector fingers in the top middle of the image). On the other 
Else output zmed and， the command 
Level 已 : 寺 es <ry 3 output <xy 


节 ljse Output zmed 


where max denotes the maxlimum alowed Size of the adaptive filter windoy 
_ Another option im the last step in Level 4 is to output zry instead of the media 
This produces a slightly less blurred result but can fail to detect Salt (Peppes 


ad the image in Fig. $.6(c), which is also reasonably free of noise, but is 
siderably less blurred and distorted than Fig. S$.6(b). 钳 
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4 Periodic Noise Reduction 
py Frequency Domain Filtering 





了 册 ， 
1 
et 


































As noted in Section 5.2.3, periodic noise manjfests itself as Impulse-like burs 
that often are visible in the Fourier spectrum. Thbe Principal approach for 
terlng these components is via notch filtering. The transfer function of a B 

terworth notch filter of order Pis given by 


PSF = fSspecial(' motion' ，1en，thetal)) 


call to fspecial returns a PsF that approximates the effects of linear 
tion of a camera by 1en Pixels. Parameter theta is in degrees, measured 
由 让 respect to the Positive horizontal axis in a counter-clockwise direction. 

e default value of Ilen is 9 and the defaujlt theta is 0, which corresponds to 
on of 9 Pixels in the horizontal direction. 


Di 
Di VD)DD(o 2) 





五 (2 7) 三 


下 


where 


172 
DO) =[G-M2-i +-N2-oo27 imfilter(f，PSF，'circular') ， 


and 
erIe 'circular' (Table 3.2) is used to reduce border effects. We then com- 


DO =[( 人 -MA2+ti +(-N/2+ 20)] Jete the degraded image model by adding noise, as appropriate: 


where (to, zb) (and by symmetry) (一 上 0， 一 20j) are the locations of the“notches: 
and Do is a measure of their radius. Note that the filter is Specified with respec 
to the center of the frequency rectangle, so it must be Preprocessed with funec: 
tion fftshift prior to its use,as explained in Sections 42 and4.3. 

Writing an M-function for notch bltering follows the same principles used 
in Section 4.5. It is good practice to write the function so that multiple notches 
can be Input, as in the approach used in Section 9.2.3 to generate multiple S， 
nusoidal noise Patterns. Once 玖 has been obtained, filtering is done using 全 
function dftfilt explained in Section 4.3.3. 


= g + noise 


re noise is a random noise Image of the same size as g, generated using 
of the methods discussed in Section S.2. 

When comparing in a given situation the suitability of the various ap- 
roaches discussed in this and the following sections, it is useful to use the 
me image or test pattern So that comparisons are meaninegful. The test Pat- 
generated by function checkerboard is particularly useful for this pur- 
because its size can be scaled without affecting its principal features.The 


3X IS 





妆 计 届 Modeling the Degradation Function 
C = checkerboard(NP，M，N) 


Where NP is the number of pixels on the side of each Square,Mis the number of 
Iows, and N is the number of columans. If N is omitted, it defaults to M. If both M 
amdN are omitted, a square checkerboard with 8 squares on the side is gener- 
ating PSFs and testing the results with various restoration algorithms. Anothers ated， 于 in addition, NP is omitted, it defaults to 10 Plxels. IJhe light Squares on 

电 he left half of the checkerboard are white. The light squares on the right half 
he checkerboard are gray. To generate a checkerboard in which all Hight 


slde the mainstream of our discussion here: for an introduction to this topi [ 
duares are white we use the command 


See Gonzalez and Woods [2002]. Finaly， when no information is availab| 
about the PSF we can resort to“blind deconvolution” for inferring the P9 
Thls approach is discussed in Section 5.10. The focus of the remainder of 世 
present Section is on various techniques for modeiing PSFs by using function 
imfilter and fspecial,introduced in Sections 34and 3.5, respectively, an 
the various noise-generating functions discussed earlier in this chapter， 

One of the Principal degradations encountered in image restoration prob 
lems is image blur. Blur that occurs with the scene and sensor at rest with re 
SPpect to each other can be modeled by spatial or frequency domain lowpass : 


>K = im2double(checkerboard(NP，M，N)) > 0.5; 


elimages generated by function checkerboard are of class double with val- 
es in the range [0, 1]. 

Because some restoration algorithms are slow for large Images, a good ap- 
ioach is to experiment with small Images to reduce computation time and 
-thus improve Interactivity. In this case, it is useful for display purposes to be 
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checkerboard 


LT8 PRe > OperttOr 
Prodtitcey 4 1L0giCal 
Feilt im2doub1le 必 
1Secd IO DAOUEECe ti 
GE OF 太 cr1ds3 
double, mAicn z 
COHStSteMT IE Ine 
ODN /OPIGr OF 
FeOOICNO 
checkerboard， 
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pixeldup 


0 开 区 人 veo ren ormovaveRw one 一 


EXAMPLEE S.7: 
Modeling a 
blurred, nolsy 
image. 


: 
-Ci 

FIGURE 5.7 

(a) OrIiginal 
image. (b) Image 
blurred usling 
fspecial with 
Len=7,and 
theta = 一 45 
degrees. 

(c) Noise image. 
(d) Sum of (b) 
and (C). 



















able to zoom an image by pixel replication. The following function does this 
(See Appendix C for the code): 


8 = pixeldup(A，m，Dn) 


0 0 O 0 0 0.0145 0 
This function dupiicates every pixel in Aatotal of m times in the vertical direc- 本 es es 
tion and n times in the horizontal direction. Inis omltted, lt defaujts to m. 0 0 0.0376 0 1283 0 0376 0 0 
Re 0 0.0376 0.1283 0.0376 0 0 0 
宙 Figure 9.7(a) Shows a checkerboard image generated by the command 0.0145 ”0.1283 “0.0376 0 0 0 0 
0 0.0145 0 0 0 6 0 


>> ff = Checkerboard(8) ; 


The degraded image in Fig.5.7(b) was generated using the commands 
.noise = imnoise(zeros(Size(f))，'gaussian'，0，0.001) ; 
>> PSF = fspecial( ' motion ，7，451) 
>> gb = Imfilter(f，PSF， circuLar ) Normally we would have added nolse to gb directly using imnoise(gb， 
aussian'，0，0.001).However, the noise image is needed later in this 
apter, SO we computed lt Separately here. 


The blurred noisy image in Fig.3.7(d) was generated as 
“gg= gb + noise; 


wsJhe noise js not easily visible in this image because its maximum value 1S on 


he degraded image]. This approach js appropriately called Imverse jlterin8， 
om the modejl discussed in Section 9.1, we can eXpIess OUI estimate 38 


5.6 塑 Direct Inverse Filtering 
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Jhis deceptively Simple expression tells us that, even 让 we knew 百 (o 分 富 
actjy we could not recover E (uv) fand hence the original, undegraded imal 
让 j because the noise component is a random function whose _ Four 
transform,， Nu), is not known. In addition, there usually is a problem 
practice with function 妃 ( 册 2) having numerous zeros.Even 让 the 
Nu 2) were negligible, dividing it by vanishing values of 豆 (，) would do 
inate restoration estimates. 
、Jhe typPlcal approach when attempting inverse 和 tering is to form the ra 
=CV)/ (uv) and then limit the frequency range for Outaining 
the inyerse, to frequencies“near” the origin.The jdea is that zeros in 已 (已 
are jess jikely to occur near the origin because the magnitude of the transfor 
typlcaily ls at its highest value in that region.There are numerous variations 
thls basic theme, in which Special treatment is given at values of (5D) fa 
which 豆 is zero or near Zero. This type of approach sometimes is call 
PSetdoinyerye fitering. In general, approaches based on inverse fijtering 
tbls type are seldom Practical, as Example 5.8 in the next section shows。， 





#re Mand N denote the vertical and horizontal sizes of the image and noise 
iays, respectively These quantities are scalar constants,and their ratio， 


Wiich is also a scalar, is Used sometimes to generate a constant array in place 
:he function sn Vsr 2 In this case, even 让 the actual ratio is not 
own, it becomes a simpie matter to experiment interactlively varying the 
ant and viewing the restored results. This, of course, is a crude appTroxi- 
iion that assSumes that the functions are constant， Replacing 
(5 2)/5Srt 2 by a constant array in the Preceding filter equation results in 
eso-called paranmzerric iener filter Asiliustrated in Example 5.8,the Simple 
of Using a constant array can yield Significant Improvements over direct in- 
Se fiterlInpg. 

Wiener flterimng 1 implemented in IPT using function deconvwnr,，which 





2 
六 和 i | 
人 导 


Wiener filtering (after N. Wiener, who first proposed the method in 1942) 攻 
one of the earliest and best known approaches to linear image restoration: Rimiage and fr is the restored image.The first syntax form， 
Wiener filter seeks an estimate 广 that minimizes the statistical error functi 人 


2 


where 尼 is the expected value operator and Fis the undegraded image,.Thes ssimes that the nolse-to-Signal ratio js Zero. Thus, this form of the Wiener fil- 
Jution to this expression in the frequency domain is 全 erl the lnverse filter mentioned in Section 5.6.The syntax 


1 | 天 (es oo) 
下) 区 (2 十 So)ASr( 


Wiener Filtering 


F(L ?7) 王 CD 7 


Where aITay; the function accepts either one. This is the syntax used to imple- 
the parametric Wiener filter in which case NSPR would be an interactive 


已 (ou) = the degradation function et 
alar input. Finajliy, the syntax 


| 玫 (2) = 五 *( 二 如 (TD) 

pi) = the complex conjugate of 厅 (ro fr = deconvwnr(g，PSF，NACORR，FACORR) 

50) = NG2 首 = the powerspectrum ofthe noise 8 
Sumes that autocorrelation functions, NACORR and FACORR, of the noise and 

undegraded image are known. Note that this form of deconvwnr uses the au- 

focorrelation of 7 and 广 instead of the power Spectrum of these functions. 


Tom the correlation theorem we know that 


(coF= SLC。 人] 


Where“。”denotes the correlation operation and \ denotes the Fourier 
fransform. This exXpression indicates that we can obtain the autocorrelation 
function ， JU yy)。/xy)，for use in deconvwnr by computing the inverse 
ourier transform of the power spectrum. Similar comments hold for the auto- 
correlation of the noise， 


Sr = |F(u zi = thepowerspectrum ofthe undegraded image 


The ratio Sn( DASr(V) is called the noige-lo-signal pomer ratio.Weseetha 
[the noise Power Spectrum is zero for all relevant values of wu and 2 this rati 
becomes zero and the Wiener filter reduces to the inverse filter discussed j 
the previous section. 

Two related quantities of interest are the average noise power and the aver 
age Image Power, defined as 


5.7 演 Wiener Filtering 二 /1 


deconvwnr 
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if the restored lmage exhibits ringing introduced by the discrete Fourier 
transform used in the algorithm,it Sometimes helps to use function edgetape 
prIIOr to caling deconvwnr. The syntax js 


where g is the corrupted image and PSF is the point spread function computed 
记 卫 xample 5./.As noted earlier in this Section, fr1 is the result of direct in- 
Tse filtering and, as expected,the result is dominated by the effects of noise. 
siD xampjle 5.7, al displayed images were Piocessed with pixeldup to 
om their size to $12 X 512 pixels.) 

The ratio, 尺 ,discussed earlier in this section, was obtained using the original 
dnoise images from 上 Xxample 3.7: 


edgetaper J = edgetaper(I，PSF) 


This function blurs the edges of the input image, I,using the point spread func 
tion, PSF. he output Image,y,is the welighted sum of I and its blurred version 












The welghting array ， determined by the autocorrelation function C、PSF = abs(fft2(noise)).^2; 和 noiSe power Spectrum 
makes J equaj to I in its central Tegion, and equal to the blurred version of . nA = Sum(Sn(:))/prod(size(noise)); 点 noise average powWer 
near the edges. Sf = abs(fft2(f)).^2; 右 image power Spectrum 
fA = Sum(Sf(:))/Vprod(Size(f) ) ; % image average power 
EXAMPLE 5.8: 桥 Figure 5.8(aj is the same as Fig. 5$.7(d), and Fig. 5.8(b) was obtained usin -R = nA/fA; 
Using function the command 
CCOTVAWNT 提 orestore tbe image using this ratio we write 
restore a blurred， SR F，， 
noisy image、 r1 = deconvwnr(g，PSF) ; : 
fr2 = deconvwnPr(g，PSF，R) ; 
ab 和 Sn 1 Fig.5.8(c) shows,this approach gives a Significant improvement over direct 
Cd 站 ae 3 erTse filterlng. 
FiIGURE 5.8 共 测 二 本 Finally we use the autocorrelation functions in the restoration (note the use 
(a) Blurred, noisy 。 、 汪 8 本 本 fftshift for centering): 
Image. (b) Result 和 ER 人 
of inverse 本 ee 0 CORR = fftshift(real(ifft2(Sn) ) ); 
filtering. :后 


fftshift(real(Ifft2(Sf) ) ) | 


) Resulit of 
(c) Result o deconvwnr(g，PSF，NCORR ，ICORR ) ; 


Wiener fjltering 
using a constant es 
ratio《d) Result 电 -0 Rig. 5.8(d) shows, the result is close to the original, although some noise is 
of wiener filterng 状 0 il evident. Because the original image and noise fanctions were known, we 


ee able to estimate the correct parameters, and Fig. 9.8(d) js the best that 
e accompjlished with Wiener deconvolution in this case. The challenge in 


tunctions， 


ice of functions used in experimenting，until an acceptable result is 
tained. 潮 


other well-established approach to linear restoration is comstraizaed /least 
1es 户 jiering, called regtiarized jiitering in IPT documentation. The defini- 

nM~1 N-1 

之 jp 二 | 


Fa= 口 ; 


工 
Pt y)# 太 y) 人 MAN 


ng this equation, we can express the linear degradation model discussed in 
ton 3.1,g(z, y) = Pt ysF(zy) +m(r yin vector-matrix form, as 


g 三 了 + 了 7 


3.8 于 Constrained Least Squares (Regularized) Filtering ”173 
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For example,suppose that g(x, y) is of size M X N.Then we can form the first 
N elements of the vector g by using the image elements In the frst TOW (0 
g(x, y),the next Nelements from the second row, and so on. Ihe resujlting vec: 
tor will have dimensions MN X 1.These also are the dimenslons of f and 7, afS 
these vectors are formed in the same manner. The matrix HL then has dime 
sions AN X MN.Its elements are given by the elements of the precedlng con: 
vojiution equatlon. 

It would be reasonable to arrive at the conciusion that the restoration PTo 
lem can now be reduced to simple matrix manipuiations. Unfortunatejly this is 
not the case.For instance,Suppose that we are working with Images of medium 
Size; Say M = N = 512. Then the vectors in the Preceding IatrxX equatio 靖 
would be of dimension 262,144 X 1, and matrix 了 would be of dimensions 
262,144 xx 262,144. Manipulating vectors and matrices of these Sizes is not 
trjivial task.The problem is complicated further by the fact that the inverse of 
H does not always exist due to zeros in the transfer function (See section 5.6 
However, formulating the restoration Problem in Imatrix form does facllitate 
derivation of restoration technlques. 

Although we do not derive the method of constrained least Squares that W 
are abonut to Present, central to this method is the jssue of the Sensltivity of t 
inverse of H mentioned in the previous paragraph. One way to deal with hi 
issue is to base optimality of restoration on a measuire of smoothness,Such as the 
second derivative of an image (e.g,, the Laplacian). TIo be meaningtu，the 
restoration must be constrained by the parameters of the Problem at hand. Ihu 
what is desired is to find the minimum of a criterion function,C,defined as 





h4 一 1 AN 一 1 








2 
| 
X=() yY 一 ( 
Subject to the constralnt 
lg 一 一 | 


where | 史 | 2 w7w is the Euclidean vector norm,t fis the estimate of theua 
degraded image, and the Laplaclan Operator V- is as defined in Section 3.5. 

The frequency domain solution to this optimization problem is given by 也 
expression 


再 (2 全) 


7 


F(i 岂 ) = 


where y is a parameter that must be adjusted so that the constraint is satisfie 量 


(fy is zero wehave an inverse filter solution),and P(uV) ES the Fourler tfran5- 
forn of the function 


昌 
;For a column vector w with icomponents,wiw = > 40 where tok,is the kth component of w. 
| 


.8 淖 Constrained Least Squares (Regujlarized) Filtering 





px 基 二 [1 4 1 
0 10 
， Tecognlze thls function as the Laplacian operator introduced in Section 3.5.1. 


Te only unknowns in the preceding formulation are y and | 全 However it can 
hown that y can be found iteratively 让 jwhich is proportional to the noise 
Wer (a scalar),is known. 
onstrained least Squares filtering is implemented in IPT by function 
deconvreg, whlich has the syntax 


fr = deconvpreg(g，PSF，NOISEPOWER ，RANGE ) 


ere g 1$ the corrupted image, fr is the restored image, NOISEPOWER is Pro- 
portional to | and RANGE is the range of values where the algorithm is lim- 


Dent deconvreg Produces an inyerse filter solution.A good starting estimate 
for NOISEPOWER 1S MMNic7 下 7 ]， where M and N are the dimensions of the 
age and the Parameters inside the brackets are the noise variance and noise 


redimean. Tbls estimate js Simply a starting point and, as the next example 


We now restore the image in Fig. 5.7(d) using deconvreg. The image is of 
“Ze 04 X 04 and we know from Example 5.7 that the noise has a variance of 
01 and zero mean.， So，our initial estimate of NOISEPOWER is 
史 ) [0.001 一 0] s 4.Figure 3$.9(a) shows the result of using the command 
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aa deconvreg 


下 入 和 AMPLE 5.9: 
Using function 
deconvreg to 
restore a blurred， 
Dolsy Image. 


a b 


FIGCURE 5.9 

(aj) TIhe image in 
Fig.3.7(d) 
restored using 3 
regujarized flter 
with NOISEPOWER 
equal to 4. (b) The 
Same image 
Testored with 
NOISEPOWER equal 
to 0.4 and a RANGE 
DTTS=7 Te7 直 
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where g and PSF are from Example 5.7.The image was Improved somewhat 咎 

the original, but obviously this is not a particularly good value for NOISEPOWE 
After some experimenting with this parameter and parameter RANGE, we ari 
at the result in Fig.5.9(bj, which was obtained using the command 


efore,“*”indicates convolution， 下 1s the estimate of the undegraded 
g and both g and Pare as defined in Section 5.1. The iterative nature of 
lgorithm 1 evldent, [ts nonlinear nature arises from the division by 广 on 
cht side of the equation， 


人 thm js difficult to answer in general.TIhe apPproach often followed is to ob- 
the output and stop the algorithm when a resujt acceptable in a given 


Thus we see that we had to go down one order of magnitude on NOISEPOW EGG 全 全 记 秆 人 


and RANGE was tighter than the default. The Wiener filtering 1 sult 渤 
Fig. 5.8(d) is much better, but we obtained that result with full knowledge:g 
the noise and image Spectra. Without that inftormation, the results obtainabl 
by experimenting with the two filters often are comparable, 


芝 the restored image exhibits ringing introduced by the discrete Fourigi 
transform used in the algorithm, it usually helps to tse function edgetape 


(see Section 5.7) prior to calling deconvreg. ad function，NUMIT is the number of iterations (the default is 10), and 


AR and WETIGHT are defined as follows， 





Iterative Nonlinear Restoration Using the SS 
Lucy-Richardson Algorithm in the DAMPAR value from their original value. Thas SUPPresses noise gen- 

ation in Such pixels, Preserving necessary image details.The default is 0 (no 

mping/) 

WEIGHT is an array of the same size as g that asSSlgns a Weight to each Pixel 

reflect its quallty. For exampjle, a bad pixel resulting from a defective imag- 


The image restoration methods discussed iD the Previous three sections a 
hnear, They also are“direct”in the sense that, once the restoration filter 站 
specified, the solution is obtained via one application of the filter.This SImpjics 
ity of Implementation, coupled with modest computational requirements and 
a well-estabjished theoretical base, have made linear techniques a fundame 
tal tool in image restoration for many years. 
During the past two decades,nonlinear iterative technlques have been gain: 
Ing acceptance as Testoration tools that often ylield resujts Superior to those 滞 
obtained with linear methods. The principal objections to nonlinear methods 4 
are that their behavior is not always predictable and that they generally re- 及 上 
qulire significant computational resources, The first objection often loses ji 
Portance based on the fact that nonlinear methods have been shown to 
Superlor to linear techniques in a broad spectrum of appPlications (Jansso 
[1997]). The second objection has become less of an issue due to the dramat 
Increase in inexpensive computing power over the last decade. The nonlineaf 
method of choice in the toolbox is a technique developed by Richardson: 
[1972] and by Lucy [1974], working Idependentiy. The toolbox refers to this 
method as the Lucy-Richardson (L-R) algorithm, but we also see it quoted 过 
the literature as the Richardson-Lucy algorithm. 
The L-R algorithm arises from a maximum-likelihood formulation (see 
Section 5.10) in which the image is modeled with Poisson statistics. Maximlz- 
ing the likelihood function of the model yields an equation that is satisfied 
when the following iteration converges: 


IXejls according to the amount of flat-field correction that may be neces- 
HIYy based on knowledge of the imaging array When simujiating blurring with 


utation pixels that are on the border of an image and thus are blurred differ- 
ntly by the PSF If the PSF is of size  X 1 the border of zeros used in 
EIGHT is of width ceil(n/2).The default is a unit array of the same size as 
put image g 

“于 the restored image exhibits ringing introduced by the discrete Fourier 
ansform used in the aigorithm,it sometimes helps to use function edgetaper 
ee Section 5.7) prior to calling deconvlucy. 


Figure $.10(a) shows an image generated using the command 
六 f = checkerboard(8); 
Which produced a square imagsge of size 64 X 64 pixels. As before, the size of 


the image was increased to Size $12 x 512 for display purposes by USing func- 
ton pixeldup: 


Sb 


AR 
Js J 三 灰 ( | 一 一) Pt y)* 大 (z, 





>> imshow(pixeldup(f，8) ); 
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deconvlucy 


= 
人 
2 


下 和 AMPELE S.10: 
Using function 
deconvlLucy to 
restore a biurred、 
nolsy Image. 
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a b 

C d PSF = fspecial('gaussian'，7，10); 

e f 

FIGURE 5.10 人 人 人 ext we blurred image f using PDF and added to it Gaussian noise of zero 
(a) Original 和 和 and standard deviation of 0.01: 

Image. (b) Image 本 

blurred and 


SD = 0.01; 


9 imnoise(imfilter(f，PSF)，'gaussian'，0，SD^2) ; 


Gausslan nolise. 
(cj) through (站 
Imasge (b) 
restored using the 
L-R algorithm 
with 9$, 10. 20, and 
100 iterations、 
reSpectively. 


人 


= Cell(size(PSF，1)72) ; 
zerosS(SiIzel(g)); 
WEIGHT(LIM + 1:end -LIM，LIM + 1:end - LIM) = 1 


芝 
FT 
4 
7 
工 
于 
有 


和 
ge:Dixels are 1S. 


is 


JP = deconvliucy(g，PSF，NUMIT，DAMPAR，WEIGHT ) ; 
imshow(pixeldup(fFr，8)) 


下 or example, Fig, 5.10(f) was obtained using 100 iterations.This image is only 
全 Sughtly sharper and brighter than the result obtained using 20 jterations. The thin 
:Dack border seen in all resujts was caused by the 0s Im array WEIGHT. 问 


2ne of the most difficult problems in image restoration is obtaining a suitable es- 
nate of the PSF to use in restoration algorithms such as those discussed in the 
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[本 畏 Geometric Tansformations and Image Registration One of the most commonly used forms of spatial transformations is the 
. ， pool tratyjorma (Wolberg [1990j). Ihe affine transform can be wTitten in ma- 
We conclude this chapter with an introduction to geometric transformati0: 


for image Testoration. Geometric transformations modify the spatial rejation 


Ship between Pixels in an image. They are often called rppersjneel rap Di 和 
1aafioms because they may be viewed as printing an image on a sheet of rub zy7yH=ioz HUIT=lo z Ho to 
and then stretching this sheet according to a predefined set of rules， D31 52 


Geometric transformations are used frequentiy to perform iaage egi 
101, a Process that takes two images of the Same Scene and aligns the 
they can be merged for visuajlization, or for quantitative comparison. In 也: 
following Sections, we discuss (T) spatial transformations and how to defi 
and visualize them in MATLAB; (2) how to apply spatial transformationst 
Images; and (3) hbow to determine spatial transformations for use in ima 
Ieglstration. 


3 1. Geometric Spatial Transformations 


5uppose that an image, 太 , defined over a (tw, z) coordinate system， undergoe' 
geometric distortion to produce an image, 8, defined over an (xz, y) coordi 
System, [hls transformation (of the coordinates) may be exXpIessed as 


(0 


For exarmple, 讶 (x, y) = T{(w,z)} = (to/2,z/2), the“distortion” is simply 
shrinking of Aby half in both spatial dimensions, as ilustrated in Fig. 5.12. : 


Coordinate 
亚 quations 














t 
C> 


0 大 一 Sr 
0 人 


局 局、 
尼 .wm 


7 了 |[(35.2)}] = (2.5.1) 
cosoe Sing 0 X 一 10cC086 一 zSint 
?= 40Sinb 十 zcCOS0 





?0 Y 
ear (horizontal) 1 0 0 入 二 训 十 GZ 
人 y=:z 
0 0 
Shear {vertical) 1 8 0 天 二 划 
7 > 本 0 1 0 y=pBo0 十 Z 


FICURE 5.12 A simple spatial transformation,. (Note that the Yy-axes in this figure d 
noOt CorTespond to the image 3aXis coordinate system defined in Section 2.1.1， As 
Imentioned in that Section，IPT on occasion uses the so-called spatial coordinate 
System in Which ydesignates rows and * designates columns. This is the system Use 和 
throughonut this section in order to be consistent with IPT documentation on the toplc 
of geometric transformations.) 


X 三 40 十 站. 
和 三 ' 远 十 0 


习 刷 让 个 机 证 
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See 9eclioH3 2.70.06 
ad 77.77 Jorada- 
CILSSIOM OFSIFEECLHLTES. 





TABLE 35.3 
Jypes of aftine 
transtformations， 
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二 formfwd 


化 efminv 


》 coordinates of a transformed point. For example, the following co 



















Thbe first Input argument, transform_type, is one of these strings: 'affine 
projective `，'box'，'composite' ,or 'custom' .These transform tyPes 
described in Table 5.4，Section 5.11.3. Additional arguments depend on 
transform type and are described in detail in the help page for maketform， : 
jin thlis section our interest js on affine transfornas. For exampjle, one Way : 
create an aftine tformis to provide theTmatrix directly as 训 


>>T= [200;030;001]; 
>> tform = maketform( affine ，T) isoften useful to see how it transforms a set of points arranged on a grid.The 


tform = jowing M-function, Vistformfwd, constructs a grIid of points, transforms 

ndims_zizn: 2 grid using tformfwd, and then plots the grid and the transformed grid 

ndims_out: 2 

9 9 0 evSection 2.10.4) and Linspace (Section 2.8.1) for creating the grid. The fol- 
1InVverse_fcn: Qinv_affine 站 

tdata: [1 X 1 struct] 5 TS 

Although it is not necesSary to use the fields of the tform structure directi 

to be able to apply it,information about T, as well as about 工 -is coOntainediii 

the tdata field: 


tion Vistformfwd(tform，wdata，zdata，N) 

5VISTFORMFWD ViSsualize forward geometric transformm， 

“VISTFORMFWD (TFORM，WRANGE，ZRANGE，N) shows two plots: an N-by-N 
“gridg in the W-Z coordinate SyStem，and the Spatially transformed 
grid in the X-Y coordinate SyStem， WRANGE and ZRANGE are 
two-element vectors specifying the desired range for the grid. N 
-Can be omitted，in which case the default value is 10. 


>> 二 form.tdata 


ans = 
T: [3Xx3 doublLe] 
Tinv: [3 x3 double] 


>> 二 form,tdata .下 


margin < 4 
-RN = 10; 


ans = 8 
2 0 0 Create the WwW-z grid and transform it， 
0 3 0 WwW zj = meshgrid(1Iinspace(wdataf(1)，zdata(2)，N)，.,,， 
0 0 1 Linspace(wdata(1)，zdata(2)，N) ); 
>> tform,tdata.Tinv [w(:) zf(3:)]; 
ans = tformfwd([Iw(:) z(:)]，tform) ; 
0.5000 0 0 
0 0.3333 0 Calculate the minimum and maximum Values of W and x， 
0 0 1 .0000 Ss Well as zZ and yY，These are UsSed So the two plots can be 


displayed using the Same scale. 
_reshape(xy(:，1)，Size(wW)); 点 reshape is discussed in Sec，8.2.2， 
reshape(xy(;，2)，Size(z)); 

= [WwW()3 X(:)] 
= [fmin(wx) max(wWXx)]; 
[2 YY 人 (0 

= fmin(zy) max(zy)]; 

(Create the W-Z plot. 
bplot(1,2,1) 5 See Section 7.2.1 for a discussion of this function， 


IPTI provides two functions for applying a spatial transformation to 
polnts: tformfwd computes the forward transformation，7{(0, z)}， and 
tforminy computes the inverse transformation， 7 一 {(x, 放 } The callin8 
syntax for tformfwd is XY = tformfwd(WZ，tform). Here, WZisaPX2 
matrix of points; each row of WZ contains the W and z coordinates of one 
point. Similarly, XY isa 已 X2matrix of points;each row contains thexand 


人 】 

情 
取 
请 
六 十 
oo 


mands compute the forward transformation of a pair of points, followed 
the inverse transform to verify that we get back the original data: 


>> WZ = [11; 3 2]; 
>> XY tformfwd(WZ，tfornm) 
XY = 


XLim(wxlimits) 
ylim(zylimits) 
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ViLSstformfwd 
ee 
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set({(gca， 'XAxiSLocation'， :top') ab 
xlabel('w':)，ylabel1('z') 人 
己 上 
% CC - . 
reate thne Xx-y plot FIGURE 5 .13 
Subplot(1，2，2) Visualizing atfine 
plot(x，y， 'b' )，axis equal，axis 工 j transformations 
hojld 9 using gridS. 
pjlot(x ，yY' ， 'b ) (a) Grid 1 
hoJd of (5 
XJLIm(wXx1Limits) transformed using 
YJiml(zylLinmits ) tform1， 
Set(gca， XAXiSLocation'， 'top ') ， 
X1abel('x， label('y， 二 0 100 120 
4 (yY ) 人 transformed using 
tform2. 
也 XAMPLE 5.12: 朵 In this example we use vistformfwd to visuajlize the effect of severa 证 ， 
visualzing affine 。 ferent affine transforms. We also explore an alternate way to create an a 攻 
transforms usin ， . ， Se transformed using 
5 tform using maketform. We start with an affine transform that scales hori tform3. 


人 tally by a factor of 3 and vertically by a factor of 2: 


>> T1= [300;020;001]; 
>> tformt = maketform('affine'，T1); 
>> Vistformfwd(tformnfT，f[0 100]，[0 100] ) ; 





Figures 3.13(a) and (b) show the result. 
A shearing effect occurs when zb1 or is nonzero in the affine T mat 
SUch as 





ri 
100 -50 0 50 100 
人 ET 













>> T2= [100;) ,210;001]; 
>> tforn2 = maketform( ' affine'，T2); 
>> VIstformfwd(tformn2，[0 100]，[0 100]) 


Figures 5.13(c) and (d) show the effect of the shearing transform on a grid,， 

An interesting Property of affine transforms is that the composition of sev: 
eral afftine transforms is also an atfine transform. Mathematically, affine trans: 
tforms can be generated Simply by using mujitiplication of the T matrices. The: 
next block of code shows how to generate and visualize an affine transform 
that is a combination of scaling, rotation, and shear. 





>> Tscale = [1.5 0 0; 0 2 0; 0 0 1]; 
ee 1.2 Applying Spatial Transformations to Images 
-Sin(pii CosS(pi 
0 0 ost computational methodqs for spatially transforming an Image fall Into 


>> Jshear = [100; .210;001]; ne of two categories: methods that use fjorwarad rapping,and methods that 
>> J3 = Jscale * Trotation * Tshear， 

>> tform3 = maketform( ' affine'，T3) ; 

>> ViIstformfwd(tforn3，[0 100]，[0 100] ) 


el in turn, copying its value into the output Image at the location deter- 
ined by T{(w, z)}. One problem with the forward mapping procedure is 
that two or more different pixeis in the input image could be transformed 


Figures 9.13(e) and (fshow the results. 
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itransfornm 


EXAMPLE 5S.13: 
Spatially 
transforming 
Images. 


Chapter 5 姐 Image Restoration 

















combine mujtiple input pixel values into a single output pixel value， Anot 虹 
er potential Problem ls that some output pijxels may not be assigned a Value 
at alj. In a more Sophisticated form of forward mapping, the four corners 0 
each input pixel are mapped onto quadrilaterals in the output image. Input 
plxels are distributed among output pixels according to how much each ou 
Put Pixel is covered, relative to the area of each output pixel. Althoug 
more accurate, thls form of forward mapping is complex and computatio 
ally expensive to implement，. 
IPT function imtransform uses inverse mmapping instead. An inverse ma 
ping procedure scans each output pixel in turn, computes the cOITesSpondings 
location in the input image using 7 {f(xz, }，and interpolates among 也 
nearest Input Image pixels to determine the output Pixel value. Inverse map 
Ping is generally easier to implement than forward mapping. 
The basic calljing syntax for imtransform is 












g = imtransform(f，tform，interp) 


where interp is astring that specifies how input image pixels are interpolatedt 
obtain output Pixels; interp can be either 'nearest'，:'bilinear， ，0 
bicubic' .TIhe interp input argument can be omitted, in which case it defau 
to bilinear'. As with the restoration examples given earlier，functi 
checkerboard Is useful for generating test images for experimenting with spatia 
transtformations. 


嘲 In this example we use functions checkerboard and imtransform 
explore anumber of different aspects of transforming images.A Lizaear coH 
Jorral rrarsformmation is a type of affine transformation that Preserve: 
shapes and angles. Linear conformal transformations consjst of a scale fac 
tor, a rotation angle, and a translation. The affine transformation matrix 这 
this case has the form 


ycosD Sino 0 
工 =| 一 Sn ycosb 0 
0. 0， [ 


JTJhe following commands generate a linear conformal transformation and 
apply lt to a test Image. 


>> 千 checkerboard(50) ; 

>> S = 0.8; 

>> +theta = pi/6; 

>> [= [S*cos(theta) S*Sin(theta) 0 


-S*Slin(theta) S*cost(theta) 0 
0 0 1T] ; 
>> tform = maketform( ' affine'，T) ， 
>> g = inmtransform(f，tform) ; 


Figures 5.14(a) and (b) show the original and transformed checkerboard images， 
The Preceding call to imtransform used the default interpolation method; 
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ab 
Cd 
e， 
FIGURE 5.14 
Afftne 
transformations 
of the 
checKkerboard 
Image. 
(a) Original 
Image. (b) Linear 
conformal 
transformation 
using the default 
Interpojlation 
(bilinear). 
(c) Using nearest 
Delghbor 
inteTrpolation. 
(d) Specifying an 
alternate fi]] 
value. 
(e) Controlling 
the output space 
location So that 
transjlatlon 1S 
visible， 
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bilinear ,As mentioned earlier we can select a different interpola 
method，such as nearest neighbor，by specifying it explicity in the call 如 
imtransfornm: 


3 Image Registration 


ple， it may be of interest to align two or more images taken at 0 the 
me time, but using different instruments, such as an MRI (magnetic TeSo- 
ce imaging) scan and a PET (positron emission tomography) scan. Or, peT- 
sthbe images were taken at different times using the Same instrument, SUuch 


>> g2 = imtransform(f，tform，'nearest' ) ; 


Figure 5.14(c) shows the result. Nearest nejghbor Interpolation ; is faster 了 tb 到 
bilinear interpolation, and it may be more appropriate in some situatio apart. In either case, combining the images or performing Saiitaiiye analysis 
bnut lt generally produces results inferior to those obtained with biline and omparisons redquires ee for 人 aberTrationas 人 by 
Interpolation. 1 

Function imtransform has Several additional optional parameters that a 
Useful at times. For example, passing it a Fil1l1Value parameter controls the 
cojlor imtransform uses for pixels outside the domain of the input image: 


Ubject Position; and other factors- 
:The toolbox SUpports image Tegistration based on the use of Comtbyol Poirly， 


jnages are Known or can 国光 sejected interactively. Figure 9.1S jllustrates the idea 
control points using a test pattern and a version of the test pattern that has un- 
Tgone projective distortion, Once a Sufficient number of control points have 
en chosen, IPT function cp2tform can be used to fit a specified type of spatial 


>> g3 = imtransform(f，tform， "FillValue'，0.5)， 


jn Fig.3.14(d) the pixels outside the original image are mid-gray instead of biac 

Other extra parameters can help resolve a common source of confusion 
garding translating images Using imtransform. For example, the followin 
commands pertorm a Pure translation: 


>> [2= [100;)010; 5050 1]; 
>> tformn2 = maketform( 'affine'，T2); 
>> g4 = Imtransform( 了 ff，tform2) ; 


The result, however, would be identical to the original image in Fig, 5.14(a 
This effect is caused by defauit behavior of imtransform. Specificall 
imtransform determines the bounding box (see Section 11.4.1 for a definitio 
of the term pourding pox) of the output image in the output coordinate SW 
tem, and by detaujlt it only performs inverse mapping over that bounding bo 
Thls effectively xmcloes the translation,. By specifying the parameters XDat 
and YData, we can tell imtransform exactly where in output space to com 
pute the result.XData is a two-element Vector that Specifies the iocation of the: 
left and right columns of the output Image: YData is a two-element vector tha 
speclfles the location of the top and bottom rows of the output image.The fo 
lowing command computes the output image in the repgion betwee 
Wo y) = (1 1)and(x,y) = (400, 400). 


>> g5 = jimtransform(fT，tfornm2，XData'  ，[1 400]，'YData'，[1 400]，,，， 
上 IJlLValue' ，0,.5); 


Figure 9.14(e) shows the resujt. 

Other settings of imtransform and related IPT functions provide addition 
al control over the result, particularly over how interpolation is performed 
Meost of the relevant toolbox documentation is in the help pages for function 
imtpPansformand makeresamp1Ler， 


“二 丰 人 
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FIGURE 5.15 
Image reglstration 
based on control 
Points. 

(a) OTiginal image 
with control 
Points (the Smal] 
circles 
Superliimposed on 
the Image)， 

(b) Geometrically 
distorted Image 
with control 
Points. 

(c) Corrected 
Image Using 8 
Projectlve 
transformation 
inferred from the 
control Points. 
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TABLE 3.4 
Transformatlon 
types SuUpported 
by cp2tform and 
maketfornm. 













Transformation 
Type 


Atfine Combination of scaling,Trotation， maketfornm 
shearing, and translation. stralght cCp2tfornm 
lines remain Straight and parallej 
lines remaln parallel. 


也 Oox [Independent scaling and translatlon maketfornm 
along each dimenslon; a Subset 
of affine. 


Description Functions 





















CormPposite Acollection of spatiai maketfornm 
transformations that are appllied 
sequentially. 







2 
Custom User-defined SPatial transformi; maketfornm 2 
二 针 二 WE ， 1 
User Provides functions that define 有 


Tand T-L， 


Linear conformal Scaling (Same in adimenslions)， cp2tform 
rotation, and translation' a Subset 
of affine. 


LVWM Locai weighted meani; a locally- Cp2tfornm 
yarying spatial transformation. 























Piecewise linear Locally varying Spatial transforfmation， cp2tform 





ctlon of control points on a pair of images, Figure 5.16 shows a screen cap- 


Polynomial Input spatial coordinates are 3 Cp2tforpm 0 人 
of thls tool, which is invoked by the command cpselect. 


polynomial function of outpnut 
Spatial coordinates. 












Projective As with the affine transformation， maketforn 
straight lines remain StTaltght， cp2tfornm 
but parallel lnes converTge toward 
vanishing points. 





matertal in this chapter is a good overview of how MATLAB and [PT functions 
be used for image restoration, and how they can be used as the basis for generating 
让 els that help explain the degradation to which an Image has been subjected. The 
pabilities of IPT for noise generation were enhanced Significantly by the develop- 


名 ent in this chapter of functions imnoise2 and imnoise3,. Similarly th ial 下 
af Re :SImIaT yY the Spatial 站 |- 
transformation to the control Points (using least Squares techniques). TIhe sp 8 avyallable in function Spfilt, especially the nonlinear filters are 8 Significant 


transformation types Supported by cp2tform are listed in Iabjle 3.4. Rextension of IPT's capabilities in this area.These functions are perfect exampiles of how 

For example, let f denote the image in Fig, 5.15(a) and g the lmage ely simple it is to incorporate MATLAB and IPT functions into new code to cre- 
Fig. 5.15(b). The control point coordinates in f are (83, 81)，(450， 3 ppjications that enhance the capabilities of an already jarge set of existing tools. 
(43, 293),， (249, 392)，and (436, 442). The corresponding control point lo 
tions in g are (68, 66),，(375, 47),， (42, 286),，(275, 434)，and (523, 532)，TIhe 
the commands needed to align Image g to Image f are as follows: 















>> basepoints = [83 81; 450 56; 43 293j 249 392; 436 442] ; 
>> 1LInputpoints = [68 66; 375 47;j 42 286; 275 434; 523 532] 
>> tform = Cp2tform(InputpointSs，basepoints， projective ) | 
>> gp = imtransform(g，tform，'XData'，[1 502]，'YData'，[1 502]) 


Figure $.15(c) shows the transformed image. 
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FICGURE 5.16 
Interactive tool 
for choosing 
coOntrol points. 
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The three color Biue component Image 


components of 


| Green component Image 
a color Pixel. 


Red component Image 


ossible colors in an RGB image is (22 所 where p ls the number of bits in each 
omponent image. For the 8-bit case, the number 18 16,777,216 colors. 

Let fR, fG,and fB represent three RGB component images.An RGB image 
gs formed from these Images by Using the cat (concatenate) operator to stack 
Pryeziezl) he Images: 

In this chapter we discuss fundamentals of color image Processing Using 也 
Image Processing Toolbox and extend some of its functlonajlity by develop 
additional color generation and transformation functions. jhe discussion 
this chapter assumes familiarity on the Part of the reader with the Principl 
and terminology of coior Image Processing at an introductory levej. 


rgb _image = cat(3，fR，TfTG，TfB) 


Jhe order iD which the Images are pljaced in the operand matters. In general， 
at(dim，A1，A2，.， . .,) concatenates the arrays along the dimension Spec- 
出 ed by dim. For example,iidim = 1,the arrays are arranged vertically, 让 dim = 
;they are 3IITanged horlzontally and, 直 dim = 3,they are stacked in the third 
Imenslion, as in Fig. 6.1. 

革 al component images are identical, the result is a gray-scale image. Let 
gb_image denote an RGB image. TIhe following commands extract the three 
omponent 1mages: 





凡 Color Image Representation in MATLAB 


As noted in Section 2.6, the Image Processing Joolbox handies color Images 
either as indexed images Or RGB (red, green, blue) images. In this sectlon We 
discuss these two image types in Some detall. 


本 > fR = rgb_inage(:，:，1T1)) 
:1 了 (CrB Images 人 
AnRGB color iaasgeisanM XxNX3arrayofcolorpirels,where each colo > fB = rgb image(:，:，3); 


pixel is a triplet corresponding to the red, green, and blue components of 
RGB image at a specific spatial location (see Fig. 6.1).An RGB image mayb 
viewed as a “Stack”of three gray-scale images that, when fed into the I 
green,and blue inputs of a color monitor,Produce a color Image on the Screen 
By convention. the three images forming an RGB color Image are referred 
as the red, green,and blue compoPenmtinzdages. TIhe data class of the component 
images determines their range of values. If an RGB image is of class doubje， 
the range of values is [0,1].Similarly,the range of values is {f0,255] or [0， 6553 
for RGB images of class uint8 or Uint16, respectively. The number of bi 
used to represent the pixel values of the component images determines the 
cep 太 of an RGB image. For example, E each component Image 1S an 8-b 
image, the corresponding RGB image is said to be 24 bits deep. Generally， 外: 
number of bits in all component images is the same. In this case,the number 和 


TIhe RGB color space usually is shown graphically as an RGB color cube, as 
-depicted ip Fig. 6.2. The vertices of the cube are the Primary (red, green, and 
blue) and secomdary (cyan,magenta, and yellow) colors of light. 

Often, it is useful to be able to view the color cube from any Perspective. 
了 Function rgbcube is used for this Purpose,The syntax is 


rgbcube(vx，VvyY，YzZ) 


Jyping rgbcube(vx，Vvy，Yvz) at the prompt produces an RGB cube on the 
MATLAB desktop, viewed from point (Vx，Vvy，Vvz).The resulting image 
Scan be saved to disk using function print, discussed in Section 2.4. The code 
for this function follows. It is self-explanatory. 
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FICURE 0.1 
Schematic 
showing how 
plIxels of an RGB 
color imapge aTe 
formed from the 
coITTeSponding 
Pixels of the three 
component 
ImagesS. 





rgbcube 


rmiedtramsrcm。 
本 
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3a hb 


FICGURE 0.2 

(aj Schematic of 
the RGB coior 
cube Showing the 
Primary and 
Secondary colors of 
leght at the vertices. 
Poimts along the 
Imain diagonal have 
gray values from 
bliack at the origin 
to whblte at point 
(1 1,1).(b)The 
RGB color cube. 


rmcfior patch cre- 
al1esy 放 ilied 2-D poty- 
8OH3 based oF 
SPDeciFed 
BDropertwyaiae Pairs. 
Zor Piore 汝 如 rza- 
for aborx patch， 
5ee 轨 e AdA477L 4 
Pepp page 如 二 闻 
mctiort. 
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Blue |(0, 0,1) 


Magenta 





Yejlow 


function rgbcube(VvVX，VvyY，Vz) 
<RGBCU8E Displays an RGB cube on the MATLAB desktop， 


(VX，VY，YVZ) .With no input arguments ，RGBCUBE Uses (10，10，4) 
as the default viewing coordinates， To view individual color 
pJanes，USe the fol10wWing viewing coordinates，wWhere the first 


other Colors are as Seen from that axXis， proceeding to the Pight 
right (or above)，and then moving clockwise， 


人 


oaeeoeoiseesoododoup de ye op oo 


COLOR PLANE ( YX， VyY， VvzZ) 
BJLUe-WMagenta-White-Cyan ( 0， 0， 10) 
Red-Yel1ow-White-Magenta ( 10， 0， 0) 
Green-Cyan-White-Yel1ow ( 0，10， 0) 
BJack-Red-Magenta-B1ue ( 0，-!0， 0) 
BJLack-81Lue-Cyan-Green (-10， 0， 0) 
BJLack-Red-Yellow-Green ( 0， 0，-10) 


% Set up parameters for function patch， 

vertices_matrix = [000001010:011;10 0i101110111]|; 
faces_hmatrix = [1562;1375;1243;248 6;3784;5687]; 
COlors = Vertices matrix 

% The order of the cube vertices Was Selected to be the sanme as 

s the ordenr of the {R,G,B) colors (e.g.，(0,0,0) correSsponds to 

5 black，(1,1,1) corresponds to White，and so on,) 


5 Generate RGB cube Using function patch . 

patch( Vertices' ，Vvertices_matrix，':Faces' ， faces_matrix，，,，,， 
下 acevertexCData'，Ccolors，'FaceColor ' ， Interp'" ，,..， 
“EdgeAJpha' ，0) 


Set UP viewing point， 
if nargin == 0 
vX= 10;VvYyY = 10; vzZ = 4; 





RGBCUBE(VX，VY，VZ) displays an RGB color cube，viewed from polnt 


Color in the sequence is the cl1oSest to the viewing axis，and the 
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“elseif nargin ~= 3 
error( Wrong number of inputs. ') 


-axis off 
View([VX，VvY，YVvz]) 
.axis Square 5 


.1.2 Indexed Images 


An indexed Image has two components: a _ dat maatrix of integers X, and a 
colortQP 1aa1rrt map. Matrix map is an 17 X 3 array of class double containing 
floating-point values in the range [0, 二 .TIhe length, m, of the map is equal to the 
number of colors it defines. Each row of map Specifies the red, green, and blue 
components of asingle color.An indexed image uses“direct mapping”of PIXel in- 
tensity Values to colormap values. The color of each pixel is determined by using 


克 扩 ree colurangy oF 
map Cre Equal 太 eP 
太 e coforzap pe- 
coO11eyQa 8Tayscale 


the coITesponding yalue of integer matrix X as a pointer into map. 开 Xis of class map 
double,then all of its components with values less than or equal to 1 point to the 
frstTow In map, all components with value 2 point to the second row and so on . 开 
Xls of class uint8 or uint16, then all components with value 0 point to the first 
IOWin map, al components with value 1 point to the second row and so on.These 
concepts are 也 ustrated in Fig. 6.3. 
Jo display an indexed imapge we write 
>> Imshow(X，map) 
”or alternatively 
->> image(X) 
:>> Colormap(map) 
人 colormap is stored with an indexed image and is automatically loaded with 
the Image when function imread is used to load the 1Inage. 
FICURE 6.3 


Elements of an 
Indexed image， 
Note that the 
Value of an 
eljement of integer 
aITayYy 基 
determines the 
rOw nuraber 记 
the colormap. 
Each row conhtains 
an 人 RGB triplet， 
andzis the total 
TiUIber of rows，. 


2-D integer arTray,X 





Value of circled element = 天 
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二 mapprox 


-Whitebg 


TABLE 6.1 
RGB values of 


Scme basic colors, 


Jhe long or short 
names (enclosed 
by quotes) can be 
used instead of 
the numerical 
tripjet to SPpecjfy 
an RGB color. 





























Sometimes it is necessary to approximate an indexed image by one with ， 


Other colors in addition to the ones shown in Table 6.1 involve fractional val- 
tewer colors. For this we use function Imapprox, whose syntax is 


es. For instance,[.5 .5 .5]lSgray[.5 0 0]isdarkredand[.49 1 .83] 
s aqdquarmaarine. 
MATLAB Provides several predefined color maps，accessed using the 


ommand 


[Y，newmap] = imapprox(X，map， 门 ) 


Thbls function returns an array Y with colormap newmap, which has at mostn colors. 
The input array X can be ofclass uint8,uint16,or double.The output Yils of class 
uint8 过 n is less than or equal to 256. If n js greater than 256.Y is of class doub]1e, 

When the number of rows in map is less than the number of distinct integer val- 
ues In X, multiple values in X are displayed using the same color in map. For exam- 
ple, suppose that X consjsts of four vertical bands of equal width, with values 1, 64， 
128, and 256. If we specify the colormapmap = [0 0 0; 1 1 1],thenalltheel 
ements in X with value 1 would point to the first row (black) of the map and all the 
other elements would point to the second row (white), Thus, the command 
imshow(X，map) would display an image with a black band followed by three 
whlte bands. In fact, this woujid be true until the jength of the map became 65, at 
which time the display would be a black band followed by a gray band, followed 
by two white bands. Nonsensical image displays can result if the length of the map 
exceeds the allowed range of values of the elements of X. 


There are several ways to specify a color map. One appProach is to use the 
statement 


.>> COlormap(map_name ) 


Which sets the colormap to the matrix map_name; an exXample is 





.>> Colormap(tcopper ) 


where copper 1S8 one of the prespecified MATLAB colormaps. The colors jn 
this map vary smoothly from black to bright copper. Hthe last image displayed 
Was an indexed Imnage, thlis command changes its colormap to copper. Aljter- 
natively the image can be displayed directly with the desired colormap: 


>> imshow(X，copper) 


Table 0.2 lists some of the colormaps available in MATLAB.The length (number 
of colors) of these colormaps can be specified by enclosing the number in baren- 
theses. For example,gray(16) generates a colormap with 16 shades of gray， 


>> map(k，:) = fr(k k) bf(k : 
| ee 0.1.3 IPT Functions for Manipulating RGB and Indexed Images 


Jable 6.3 jlists the IPT functions suitable for converting between 有 RGB, in- 
dexed, and gray-Scale images. For clarity of notation in this section，we use 
rgb_image to denote RGB images, gray_jimage to denote gray-scale images， 
“bw to denote black and white images, and X,to denote the data matrix compo- 
_nent of indexed images. Recall that an indexed image is composed of an inte- 
”ger data matrix and a colormap matrix. 

Function dither is appjicable both to gray-scale and color images. Dither- 
-ing is a process used mostiy in the printing and publishing industry to give tbe 
Visual impression of shade variations on a printed page that consists of dots. In 
the case of gray-scale images, dithering attempts to capture Shades of gray by 
-producing abinary image of black dots on a white background (or vice verISa)， 
The sizes of the dots vary from small dots in light areas to increasingly larger 
_dots for dark areas. The key issue in implementing a dithering algorithm is a 
fradeoff between “accuracy”of visual perception and computational complex- 
ity.The dithering approach used in IPT is based on the Floyd-Steinberg algo- 
Tithm (see Floyd and Steinberg [1975], and Ulichney [1987]). The syntax Used 

by function dither for gray-Scale images is 


where [r(k) g(kK) bf(k)] are RGB values that Specify one IOow of a col- 
ormap, Ihe map is filled out by varying kK. 

Jable 6.1 lists the RGB values for some basic colors. Any of the three for- 
mats Shown in the table can be used to specify colors. For exampjle, the bacK- 
ground color of a figure can be changed to green by using any of the following 
three Statements: 


>> Whitebg( g') 
>> Whitebg( green ' ) 
>> Whitebg([0 1 0]) 


Long name Short name RGB yalues 


81Lack 
BJLue 
Green 
CyYann 
Red 
Magenta 
YeJllLoW 
White 


bw = dither(gray_image ) 


之 3 瑟 站 口 站 厅 


where, as noted earlier, gray_image is a gray-scale imasge and bw is the 
-dithered result (a binary image). 
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:When working with color images, dithering is used pri >ipally in conjunc- 











































































N Description 

Some of the on with function rgb2ind to reduce the number of colors in an image. This 

MATLAB autumn Varies Smoothly from red, through orange, to yellow- function is discussed later in this section 

predefined bone A gray-sScale colormap with a higher value for the blue component. _Function graysLice has the syntax 

colormaps. This colormap ls useful for adding an “electronlic” look to gray- 

scaje images. 坟 
colorcube Contalns as Imany regularly spaced colors in RGB color space aS Se 
Possibjle, while attermpting to PIovide more steps of gray, pure red， . . 
pure green, and pure biue. This function Produces an Indexed image by thresholding gray_image with 
co01 Consists of coiors that are Shades of cyan and imagenta. It Varies threshold Values 
smoothly from cyan to magenta. 1 2 可 
coppenr Varies Smoothly from black to bright copPer. ee 
fl1ag Consists of the colors red, white, bjue, and black.Thbis colormap 好 天 有 
lor Wi indeXx 1; t. ， 
gray we 0 ee As noted earjlier the resujting indexed image can be viewed with the com- 
hot Varies smoothjy from black, through shades of red, orange, and Imand LTmShow(X， map) using a map of apProPTIlate length [e.g., jet(16)].An 
yellow, to white. alterriate SyntaxX 1S 
hsv Varies the hue component of the hue-saturation-value color 
model.The colors begin with red, pass through yellow, green, cyan， X = graysSlice(gray image，YV) 
blue, magenta, and return to Ted. The colormap is Particularly 
appPIopIiate for displaying Penodic functions， Where v i a vector whose values are used to threshold gray_image. When 
jet 二 全 red, and passes through the colors cyan， used in conjunction with a colormap, grayslLice is abasic tool for pseudocol- 
ow, and orT ， .。 ， 。 
人 se ee Re 和 or image Processing, Where specified gray intensity bands are assigned differ- 
property and ashade of gray Consult online help regarding ent colors. The Inpnut Image can be of class uUint8， uint1 6， orT double， The 
fn 二 的 EGREEE threshold vajlues in v must between 0 and l, even 二 the input image is of class 
pink Conittains pastel shades of pink.The pink colormap Provides sepia ulntg 人 uiEnt16. The functton performs the necessary Scajling. 
tone cojlorization of grayscale photographs. Function gray2ind, with syntax 
Prism Repeats the slx colors red, orange,yellow, greemn, blue, and violet， 
spring Conslsts of colors that are shades of magenta and yellow. [X，map]l = gray2ind(gray image，n) La gFay2ind 
Summenr Consists of colors that are Shades of green and yejllow. 人 
ee 0 ee cales, then rounds image gray_image to produce an indexed imagpe X with 
中 -colormap gray(n),Itnis omitted,it defaults to 64.The input image can be of 
class uint8, uint16,or double.The class of the output image Xis uint8ifnis 
jess than or equal to 256, or of class uint16 让 n is greater than 256. 
_Eunction ind2gray, with the syntax 
gray_image = ind2gray(X，map) 

im pme | 

IPT functions for 上 -converts an indexed image, composed of X and map, to a gray-scale 1mapge, 

converting dither Creates an indexed Image from an RGB image by dithering. Array Xcan be of class uint8, uint16,or doub1le.The output jimage ls of class 

between RGB， graySlice Creates an indexed Image fom a gray-scale intensity image by :doub1e， 

indexed, and gray- mujtilevel thresholding， jhe syntax of interest in this chapter for function rgb2ind has the form 

Scale lntensity gray2ind Creates an indexed image from a gray-Scaje intensity image. 

Images. ind2gray Creates a gray-Scajle intensity image fom an indexed image. _ 了 
rgb2ind CrIeates an indexed image from an RGB image， [map] = rgb2ind(rgb_image，n，dither_option) 922nd 
ind2rgb Creates an RGB image from an indexed image， wh 
hgb2gray Creates a gray-scale image from an RGB image， ere n determines the length (number of colors) of map, and dither_option 





Can have one of two values: 'dither' (the defaulb dithers, 让 DecessSary to 
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gbagray 


上 EXAMPELE 6.1 
Hiustration of 
Some of the 
functions in 


Tabjle 6.3， 
















achieve better color resolution at the expense of Spatial resolutlon; CODVeISely， 
nodither maps each color In the original image to the closest colior in the new 
map (depending on the value of n). No dithering is performed. The input image 
can be of class uint8,Uint16,or dqouble, TIhe output array, X, ls of class Uint8 让 
n ls less than or equal to 256; otherwise it 1 of class uint16. Example 0.1 shows 
the eftiect that dithering has on color reduction. 

Function ind2rgb, with Syntax 


rgb_image = ind2rgb(X，map) 


converts the matrlx X and corresponding colormap map to RGB format; X can 
be of class uint8，uint16，oT doub1e、Jhe output RGB image is an 
必 xANx3arrayofcass doub1le. 

Finally function rgb2gray, with Syntax 


gray_image = rgb2gray(rgb jimage ) 


convetts an RGB image to a gray-scale image. Ihe input RGB Image can be of 
class Uint8, uint16, or doublei the output Image ls of the Same class as the: 
inpnut， 





Function rgb2ind is qulte useful for reducing the number of colors im an: 
RGB image.As an illustration of this function, and of the advantages of using 
the dithering option，consider Fig, 6.4(a)，which ls a 24-bit RGB image, 下 
Figures 6.4(b) and (c) show the results of using the commands : 


>> [X1，map1] = rgb2ind(f，8，'nodither ') ， 
>> Imshow(X{+，mapl) 


and 


>> [X2，map2] = rgb2ind(f，8， 'dither  ) ; 
>> figure，jimshow(X2，map2) 


Both images have only 8 colors, which is a significant reduction in the number 
of possibjle colors in f, which, for a 24-blit RGB image exceeds 16 mijlllon, 3S 
mentioned earfier. Figure 6.4(b) has noticeable false contouring, especially 1 
the center of the large flower. The dithered image Shows better tonality, and 
considerably less false contouring, a result of the“Tandomness”introduced by 
dithering. The Image is a little blurred, but it certainly 1$ visually SuperIlior to 
Fig. 6.4(b). 

The eftects of dithering are usualjy better lllustrated with gray-Scale 1mages. 
Figures 0.4(d) and (e) were obtalined using the commands 


>> 9 = rgb2gray(T) ; 
>> g1 = dither(g) |; 
>> figure，limnshow(g); figure，1Imshow(g1 ) 
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有 
b <c 
d e 


FIGURE 6.4 

(aj) RGB image. 
(bj Number of 
cojors reduced 

to 8 withonut 
dithering. 

(c) Number of 
colors reduced to 
8 with dithering. 
(d) Gray-scale 
version of (a) 
obtalined using 
function 
rgb2gravy. 

(e) Dithered gray- 
Scale Image (thils 
1S a binary image). 
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rgbantse 

































The Image in Flg. 0.4(e) is a binary image, which again represents a significan 尺 1.000 ”0.956 0.621 | 忒 
degree of data reduction. By looking at Figs. 6.4(c) and (ej, it is clear wh Cj =|1.000 -0.272 -0.647 | 7 
dithering is such a staple in the Printing and publishing industry, especially 刀 1.000 -1106 1703 || e 


Situations (Such as in newspapers) where paper quality and printing resotutig 


are low. TPTfunction ntsc2rgb implements this equation: 


rgb_image = ntSc2rgb(yiq_image ) 





Convertting to Other Color Spaces 


As explained in the Previous section, the toolbox represents colors as RGB val 习 :Both the Input and output images are of class doub1e. 
ues, directly ip an RGB image, or indirectly in an indexed image, where the col 

ormap 1 stored in RGB format, However there are other color spaces (al 
called coior modelks) whose use iD Some applications may be more convenien 
and/or appropriate. IJhese include the NTSC, YCbCr, HSV CMY CMYK, and 
了 LI color spaces. The toolbox Provides conversion functions from RGB to ths 
NTNSCYCbCr HSV and CMY color spaces, and back. Functions for convertin 
to and from the HSI color space are developed later in this section， 


The YCbCr color space ls used widely in digital video. In this format, luminance 
information is represented by a single component Y and color information is 
ed as two colordifierence components, Cb and Cr. Component Cb is the d 计 - 
feirence between the blue component and a reference value,and component Cris 
fhe difference between the red component and a refereace value (Poynton 
了 4996])， Jhe transformation used by IPT to convert ffom RGB to YCbCr is 


他 
0.4.1 NTSC Color Space 7 16 65.481 ”128.553 ”24.966]R 
The NTSC color system is used in television in the United States. One of the CD |=|1128 |+| -37.797 -74203 112000 ce 
malin advantages of this format is that gray-scale information is separate fro Cr 128 112.000 -93.786 -18214 || 


color data, So the Same Signal can be used for both color and monochro 
television sets. In the NTSC format, image data consists of three compPonents: 
LU1U1G11Ce (Y), Pae (Tandsattration (Q), where the choice of the letters YIG 
is conventional. The luminance component represents gray-scale informatio 
and the other two components carry the color information of aTV Signal.The: 
YICQ components are obtained from the RGB components of an image USing 
the transformation 


The conversion function js 
ycbcr_image = rgb2ycbcr(rgb image) 


The input RGB image can be of class uint8, uint16, or double.The outpnut 
mage js of the same class as the input.A similar transformation converts 他 om 
CCbCr back to RGB: 


了 0.299 ”0.$87 0.I14 | 民 
! |= | 0.396 -0.274 一 0.322 | G rgb_image = ycbcr2rgb(ycbcr_image) 
他 2 2 2 小 殖 


DThe input YCbCr image can be of class Uint8,uUint16,or doub1e.The outpnut 
Note that the elements of the first row sum to 1 and the elements of the ne Image is of the same class as the Inpnut， 
fwo IOwS Sum to 0. Thls il8 as expected because for a gray-scale image all 节 

RGB components are equal so the fand Q components should be 0 for suc 


an image.FEunction rgb2ntsc performs the transformation: 


?2.3 The HSV Color Space 


HSV (hue, saturation ， Value) is one of several color Systems used by people to 
seject colors (e.g., of Paints or inks) from a color wheel or palette. This color 
System is consjiderably closer than the RGB System to the way in which hu- 
JInans experience and describe color sensations. In artist's terminology hue， 
Saturation, and value refer approximately to tint, Shade, and tone， 

:JIhe HSV color space is formulated by looking at the RGB color cube along 
lts gray axis (the axis joining the black and white vertices), which resujts in the 
iexagonally shaped color palette shown in Fig. 6.5(a). As we move along the 
Yertical (gray) axis in Fig, 6.5(bj, the size of the hexagonal plane that is perpen- 
dicular to the axis changes, yielding the volume depicted in the figure. Hue is 


YIq_image = rgb2ntsc(rgb_image) 


where the Input RGB image can be of class uint8, uint16, or double,. 了 
output image is an M XxNX3array of class doub1le. Component imag 
YIq_image(:，:，1) istheluminance, yiq_image(:，:，2) isthe hue,an 
yiq_image(:，:，3) is the saturation imapge. 

>imilarly, the RGB components are obtained from the YIQ component 
Using the transformation: 
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a b 
FIGURE 6.5 
(a) The HSYV 


color hexagon. 
(b) TIhe 和 SSY 


hexagonal cone. 





- rgb2 hsv 












































Most devices that deposit colored Pigments on paper, Such as color pIinterTs 
and copiers, require CMY data input or perform an RGB to CMY conyversion 
internally. This conversion is Performed using the Simple equation 


Cyan C | 展 
MI=|11 = 一 1C 
下 妇 


here the assumption js that all color values have been normallized to the range 
]] .This equation demonstrates that light reflected from a Surface coated with 
Ure cyan does not contain red (tbat is C = 工 一 丸和 the equation), Similarly， 
ure mapgenta does not reflect green, and pure yellow does not reflect blue. Ifhe 
receding equation also Shows that RGB values can be obtained easlly from a set 
of CMY values by subtracting the Individual CMY values from 1|. 

_In theory equal amounts of the pigment primaries, cyan, magenta, and yel- 
wshould produce black. In practice, combining these colors for PITinting Pro- 
ces a mnuddy-looking black. So, in order to produce true black (which ls the 
redominant color in Printing),a fourth color, piack,is added,giving rise to the 
MYK color model. Thus, when publishers talk about“four-color printing，” 
iey are Teferring to the three-colors of the CMY color modejl plus black. 

_ Function imcomplement introduced in Section 3.2.1 can be used to convert 
om RGB to CMTY: 


ve 





expressed as an angjie around a color hexagon,typicaliy using the red axis as the 
0? axis.Value is measured along the axis of the cone,JhbheY = 0end of the axis 
is black.TheyY = 1 end of the axis is white, which lies In the center of the 纪 
color hexagon in 下 g. 6.5(a). Thus, this axis Tepiesents all shades of 芷 ay Satura- 
tion (Purity of the color) is measured as the distance from the Y axls. 

The HSV color system is based on cylindrical coordinates. Convyerting from 
RGB to HSV is simply a matter of developing the equations to map RGB val- 
ues (which are in Cartesian coordinates) to cylindrical coordinates. This topic 
is treated in detail in most texts on comptuter graphics (e.g.,See Rogers [1997]) 
so we do not develop the equations here， 

The MATLAB function for converting from RGB to HSVY is rgb2hsyv 
whose SyntaxX 1S 


cmy_jimage = imcompJenment (rgb_inagel) 
re use this function also to convert a CMY image to RGB: 


rgb_image = imcomplLlement (Cnmy_image ) 





4.2.5 The HSI Color Space 


With the exception of HSV, the color Spaces discussed thus far ar” not well 
Suited for descripipng colors in termas that are Practical for human InterpPreta- 
on. For example, one does not refer to the color of an automobile by gliving 
the percentage of each of the Pigment primaries composing its cojor. 

_ When humans view a color object, we tend to describe lt by lts hue, SaturTa- 
tion, and brightness. Bre is an attribute that describes a Pure color (e.g., Pure 
yellow orange, or red), whereas Saluratiom gives a Imeasure of the degree to 
Whbich a pure color is diluted by white light. 召 rigjtness is a Subjective descrip- 
tor that is practically impossible to measure. It embodies the achromatlc Do- 
ton of intenasiby and js a key factor in describing color Sensation. We do know 
that intensity (gray level) is a most Useful descriptor of monochromatic im- 
ages.This quantity definijtely is measurable and easjly interpretable. 

The color space we are about to present, called the 已 S7 (hue, saturation, iD- 
tensity) color space, decouples the intensity component from the color-carTying 
Information (hue and saturation) in a color ; Image. As aresult,the HS9Imodel ls 
an ideal tool for developing image-processing algorithms based on coilor 
descriptions that are_ natural and intuitive to humans who, after all are the 
developers and users of these algorithms. The HSV color Space is SomewWhat 


hsv_image = rgb2hsv(rgb_image) 


The InPut RGB image can be of class uint8, uint16, or doub1e; the output : 
image is of class double.The function for converting from HSV back to RGB 
is hSv2rghb: 


rgb _ image = hsv2rgb(hSsv_image ) 


The input image must be of class doub1le. The output also is of cjass doubj1e 


.4.4 The CMY and CMYK Color 9paces 


Cyan, magenta, and yellow are the secondaTy colors of ]ight of， alternatively 
the Primary colors of pigments. For example, when a Surface coated with cyan : 
pigment is illuminated with white light, no red light is reflected from the SuT 
face, That is, the cyan pigment subtracts red light from reflected whlte light， 
which itself is composed of equal amounts of red, green,and blue lght， 
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FIGURE 6.6 

及 Relationship 
between the RGB 
and HSI color 
models. 











similar, but its focus is on presenting colors that are meaningful when interpref Green Yellow 


ed in termns of a color artist's Palette. 

As discussed in Section 6.1.1, an RGB color image is composed of thre 
monochrome intensity images, So it should come as no Surprise that we shonul 
be able to extract intensity from an RGB image.This becomes quite clear 这 w 
take the color cube from Fig. 6.2 and stand it on the black， (0, 0,0)，vVertex 
with the whbite vertex, (1, 1, 1)， directiy above it, as Fig. 6.6(a) Shows.As noted 
In connection with Fig. 6.2,the intensity is along the line joining these two ver 
tlces. In the arrangement shown in Fig. 6.6,the line (intensity axis) joining the 
black and white vertices is vertical. Thus, 让 we wanted to determine the inten: 
Sity component of any color point in Fig. 6.6, we would simply pass a Plane 
PerPpeadicuiar to the intensity axis and containing the color point. The inte 
Sectlon of the plane with the intensity axis would give us an intenslty value 雇 
the range [0,1]. We also note with a little thought that the saturation (purity) 
of a color increases as a function of distance from the intensity axis. In fact， the 
Saturation of points on the intensity axis is zero, as evidenced by the fact that 
all points along this axis are gray. 

IJn order to see how hue can be determined from a given RGB point, con- 
slder Fig. 6.6(b), which shows a plane defined by three points, (black, white: 
and cyan). The fact that the black and white points are contained in the plane 
tells us that the intensity axis also is contained in the plane. Furthermore, we 
see that 41 points contained in the plane segment defined by the Intensity 3aXis 
and the boundaries of the cube have the samaze hue (cyan in this case). This is 
because the colors inside a color triangle are various combinations or mjxtures 
of the three vertex colors. If two of those vertices are black and white, and the 
third is a color point, all points on the triangle must have the same hue since 
the black and white components do not contribute to changes in hue (of 
course, the intensity and saturation of points in this triangle do change). By ro- 
tating the Shaded plane about the vertical intensity axis, we would obtain di 
terent hues. From fthese concepts we artive at the conclusion that the hue， 
saturation, and intensity values required to form the HSI space can be ob- 
talined from the RGB color cube.That is, we can convert any RGB point to a 
coOITesSponding point is the HSI color model by working out the geometrical 
formulas describing the reasoning just outlined in the Preceding discussion， 


Cyan Red 


Blue Magenta 


Green Yellow Green Yeliow 人 GE 


Red Cyan Red Cyan Yeliow 





Blue Magenta Biue Magenta Blue Magenta Red 































.Based on the Preceding discussion, we See that the HSI space consists of a 
rtical intensity axis and the locus of color points that lie on a plane perpen- 
iicular to this axis. As the plane moves up and down the intensity axis, the 
boundaries defined by the intersection of the plane with the faces of the cube 
have either a triangular or hexagonal shape.This can be visualized more read- 
jy by looking at the cube down its gray-scale aXis, as Shown in Fig, 6.7(a). In 
this plane we see that the primary colors are Separated by 120”. The secondary 
”colors are 60" from the primaries, which means that the angle between Sec- 
ondary colors also is 120。, 

、，Figure 6.7(b) shows the hexagonal shape and an arbitrary color point 
shown as a dot).The hue of the point ls determined by an angle from some 
reference point. Usually (but not always) an angle of 0” from the red axis des- 
gnDates 0 hue, and the bue increases counterclockwise from there. The satura- 
ion (distance from the vertical axis) is the length of the vector from the Origin 
0 the point, Note that the origin is defined by the intersection of the coler 
plane with the vertical intensity axis. The important coImponents of the HSI 
olor space are the vertical intensity axis, the length of the vector to a color 
oint, and the angle this vector makes with the red axis.Therefore ,it is not un- 
suajl to see the HSI plane defined is terms of the hexagon just discussed, a tri- 
ngle, or eyen a circle, as Figs. 6.7(c) and (d) show. The shape chosen is not 
Important because any one of these shapes can be warped into one of the 
other two by a geometric transformation, Figure 6.8 shows the HSI model 
based on color triangles and also on circles， 


Converting Colors from RGB to HSIi 


了 the following discussion we give the RGB to HSI conversion equations 
Without derivation, See the book Web site (the address is listed in Section 1.5) 
fora detailed derivation of these equations. Given an image in RGB color for- 
mat, the 妃 component of each RGB pixel is obtained Using the equation 
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FICURE 6.7 Hue and 
saturation in the HSI 
color model.The dot 
ls an arbitrary color 
point. Ihe angje from 
the red axis gives the 
hue,and the length of 
the vectorls the 
Saturation. TIhe 
intensity of all cojors 
in any of these planes 
is given by the 
Posltion of the plane 
on the vertical 
intensity axjls. 
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FICURE 6.8 The 
HS9SI color modej 
based on (al) 
triangular and (bb) 
circular cojor 
Planes. The 
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Clrcjes 3aTe 
perpendicular to 
the vertical 
intenslty aXls. 











7 三 0.75 


White 


人 


Black 


Black 








SS 


Red 








ee 可 ee rar Te 四。 Yerr。 mr Ar mw 


Red 


Magenta 


ay we 

















革 [(R-G)+(R 一 万 )] 


人 


晴 一 COS 


e saturation component 1$ given by 


9 = 三 1 一 人 C, 万 )] 


Finally, the intensity component 1S glVen by 
7 三 5(R 下 克 寺 下 ) 


assumed that the RGB values have been normalized to the range [0， 也 ， 
andthat angle 9is measured with respect to the red axis of the 于 SI space, as 10- 
jicated in Fig. 6.7.Hue can be normalized to the range [0,1] by dividing by 300- 
values resulting from the equation for 五.TIhe other two HSI components al- 
eady are in this range 这 the given RGB values are in the interval [0, 吉 . 


Converting Colors from 再 SI to RGB 


Given values of HSI in the interval [0,1j,we now find the corresponding RGB 
yalues in the same Iange.The applicable equations depend on the values of 忌 . 
There are three sectors of interest, corresponding to the 120?” intervals In tbe 
separation of primaries (see Fig. 6.7). We begin by multiplying 1 by 300 ， 
which returns the hue to its original range of [0?, 360"]. 


RG Seclor (0?" 三 囊 < 120?): When 万 is in this sector the RGB components 
are given by the equations 
= 7(1 一 号 ) 


9 CoOS 五 
cos( 60?” 一 五 ) 


杞 | 
| 


本 
and 

CC=37 一 (R+ 也 ) 

GB sector (120" 三 媚 < 240?): 开 the given value of 如 is in this sector， We 
.hrst subtract 120? from it: 

百 = 玖 一 120” 

Then the RGB components are 

7(1 一 9) 


汪 9S cos 瑟 
cos(60? 一 五 ) 


玉 


C= 了 
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rgb2hsi 


睛 一 一 















den = sqrt((r 一 g).*2 + (rr 一 bl.*(g 一 b)); 


and 县 
theta = acos{(num./(den + eps) ) ; 


有 =37-(R+OGI 


BR sector (240"” 三 六 三 360?): Finaly, 让 豆 is in this range, we subtract 240* > gj) = 2*pi 一 Hb > 9); 


from it: H/(2x*pi) ; 
玉 = 互 --240? = min(mintr， 9)，b) ; 
=Pr+a 十 b ; 
Then the RGB components are en(den == 0) = eps; 
G = 7(l - 9) 1 一 3.* num./den; 
一 一 一 0 ; 
9cos 吾 
cos(60* - 万 ) (站 +g+b)73i 


:Combine al1 three results into an hsi image， 


and si = cat(3，H，S，I) ; 一 一 
R=37 一 (GT+DD) 
Use of these equations for image processing is discussed later in this chapter An M-function for Converting from HSIto RGB 


The following function， 
An Mi-function for Converting from RGB to HSI 


The following function， rgb = hsi2rgb(hsi) 


implements the equations for conyverting from HSI to RGB. The documenta- 
on ln tbe code detalls the use of this function. 


hsi = Prgb2hsi(rgb) 


Implements the equations just discussed for converting from RGB to HSI. To. 
SIImplify the notation, we use rgb and hsi to denote RGB and HSI ij images,Tre-- 
spectively. The documentation in the code details the use of this function 


function rgb = hsi2rgb(hsi) 
%HSI2RGB Converts an HSI jimage to RGB . 
RGB = HSI2RGB(HSI) converts an HSI image to RGB，where HSI 
is asSsunmed to be of class doub1le with': 

hsi(:;，:，1) = hue image，asSSumed to be in the range 

[0，1j by having been divided by 2x*pi， 

，2) = Saturation image，in the range [0，1]， 
，3) = intensity image，jin the range [0，1+]， 


function hsi = rgb2hsifrgb) 

5%5RGB2HSI Converts an RGB image to HSI， 

HSI = RGB2HSI(RG8) converts an RGB image to HSI，The input Image 
1S asSSumed to be of size M-by-N-by-3，where the third dimension 
accounts for three image planes: red，green，and blue，in that 
order，If all RGB component images are equaLl，the HSI conversion 
is undefined，The input image can be of Class double (With values 


hsl1t( :， 
站 SS 


The components of the output image are: 


in the range [0，1])，uint8，or uint16， rgb(:，:，1) = red， 
rgb(:，:，2) = green， 
rgb(:，:，9) = blue， 


The output image，HSI，lis of class double，where: 


cp ooaoeopaoeoo de ce oo 风 


hsi(:，:，1) = hue image normalized to the range [0，1] by Extract the individual HSI component images. 
dividing all1 angle Values by 2*pi， = hsi(:，:，1) * 2 * pii 
hsi(:，:，2) = Saturation image，in the range 鸭 寺 十 > 9 
各 hsi(:，:，9) = intensity image，in the range [0，1]， 从 


ImplLement the conversion equations ， 

= zeros(Size(hsi，1)，sSizefhSsi，2)) 

= Zeros(Slize(hsli，1)，Size(hsi，2)); 
) 


% EXxtract the individual component immages， 
rgb = im2double(rgb)， 


1 
9 ED 日 (二 = zeros(SLze(lhsi，1)，sSizeflhsi，2)); 
TD 


5 RG sector (0 <= H < 2x*pi/3) ， 
dx = find( (0 <= H) & (H < 2x*pi/3) ) 
(idx) = I(idx) .* (1 一 S(idx)); 


oo 


Implement the conversion equations . 
num=0.5S*(Pr 一 g)+(r 一 b)); 


LD 
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from 0?” to 360?” (ie., from the lowest to highest possible values of hue). Ihis 
s precisely what Fig 6.9(a) Shows because the lowest value ls Tepresented as 
black and the highest value as whlte im the gure， 

_Tphe saturation image in Fig. 0.9(b)j shows Progressively darker values to- 
ward the white vertex of the RGB cube, indicating that colors becorme less and 
ess Saturated as they appProach white. Finally，every Pixel in the intenslty 


R(idx) = IUidx) .* (1 + S(idx) .* cos(H(idx)) ./ 
s(pi/3 -HLidx))) 


| 


G(IdXx) 3*TI(idx) 一 (R(idx) + BiIdxh) 

% BG Sector (2*pi/3 <= H < 4*pi/13)， 

idx = find( (2*pi/3 <= H) & (H < 4*pi/13) ) 
R(idx) = TI(idx) ,* (1 一 S(idx)); 


G(idx) = I(idx) .* (1 + S(idx) .* cos(H(idx) - 2*pi/3) ，/ image Shown iD Fig. 0.9(c) js the average of the RGB values at the corre- 
cos (PiI 一 H(idx))); ponding Pixei im Fig. 0.2(b). Note that the background in thls Image 18 whlte 
B(idx) = 3*TI(idx) 一 (R(idx) + G(Idx)); because the intensity of the background in the color image is white. It is black 


% BR_ sector， in the other two images because the hue and Saturation of white are Zero. 疆 


idx = findf (4*pi/3 <= H) & (H <= 2*pI)) 

G(idx) = I(idx) .* (1 一 Sidx)); 

B(idx) = I(idx) .* (1 + S(idx) .* Cos(H(idx) 一 4*pli/3) ./ .……， 
CosS(5*pi/3 一 H(idx))); 


The Basics of Color Image Processing 


上 


n this section we begin the study of processing techniques appPjlcable to cojlor 
images. Although they are far from being exhaustive, the techniques devel- 
oped in the sections that follow are illustrative of how color images are han- 
dled for a variety of image-processing tasks. For the purposes of the following 
discussion we Subdivide color image Processing into three Principal areas: 
1)colortransforraatiozs (also called color mappin8sh(2) SPQLIGL ProcesSI28 of 
individual color pljanes; and (3) color vector processi118. The first category deajls 
with processing the Pixejls of each color Plane based strictly on their values and 
ot on their spatial coordinates. This category is analogous to the materiajl in 
ection 3.2 dealing with intensity transformations. Ihe second category deals 
with spatial (neighborhood) filtering of idividual color Planes and is analo- 
ous to the discussion in Sections 3.4 and 3.3 on spatial fjtering. 

、 The third category deals with techniques based on pIocessing all compo- 
ents of a color image Simultaneously Because full-color images have at least 
hree components, color pixels really are vectors. For example, in the RGB Sys- 
em, each color point can be interpreted as a Vector extending from the origin 
o that point in the RGB coordinate System (See Fig. 6.2). 

Let ec represent an atfbitrary vector In RGB color space: 


| 





R(idx) = 3x*TIT(idx) 一 (G(Iidx) + B(Idx)) 
% Combine allL three results into an RGB image，Clip to [0，1] to 
% Compensate for floating-point arithmetic rounding effects， 

rgb = cat(3，R，G，B) 

rgb = max(min(rgb，1)，0) | 





卫 XAMPLE 0.2: 日 Figure 6.9 shows the hue，saturation,， and intensity coOmponents of an 
Converting from image of an RGB cube on a white background，similar to the image 
RGB tp 下 Fig, 6.2(b). Figure 6.9(a) is the hue image. Its most distinguishing feature 1 
the discontinuity in value along a 4$” line in the front (red) Plane of the 
cube. To understand thbe reason for this discontinuity, refer to Fig. 0.2(b)， 
draw a line from the red to the white vertices of the cube, and select a Point 
in the middle of this line. Starting at that point, draw a path to the right, fol- 
lowing the cube around until you return to the starting point, The major col]- 
ors encountered on this path are yellow, green, cyan,blue, magenta, and back 
to red.According to Fig. 6.7,the value of hue along this pathbh should increase 


《 尺 慌 
c=|ccjl=|C 
蕊 史 已 


This equation indicates that the components of ec are Simply the RGB compo- 
ents of a color image at a point, We take into account the fact that the coior 
omponents are a function of coordinates (xz, y) by Using the notatlon 


CR(x， yj 及 (xy) 
c(2 y) 三 | cc | =| CU y 
CB(L， 了 (2， 了 





or an image of Size M X N，there are MN Such vectors， cx y)，for 
二 和 ,2 二 

_ In some cases, equjivalent resujlts are_ obtained whether color Images are 
processed one plane at a time or as vector quantities. Howevel, as expjained in 





ab c 
FIGURE 6.9 HSI component images of an image of an RGB coior cube. (a) Hue, (b) saturation, and (c) 1 
intenslty 1mages. 
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FIGURE 6.10 
9%patial masks for 
gray-scale and 
RGB color 
Images. 


quations given here are straightforward extensions of the intensity transforma- 
on equation introduced in Section 3.2.As is true of the transformations in that 
action, all mn pseudo- or full-color transformation functions 7 
ndependent of the spatial image coordinates (xy)， 

2 Some of the gray-scale transformations introduced in Chapter 3，like 
mcomplement, which computes the negative of an image, are independent of 
he gray-level content of the image being transformed. Others, like histeq， 
hich depends on gray-level distribution, are adaptive, but the transformation 
ixed once the necessary parameters have been estimated. And still others, 
imadjust, which requires the User to select appropriate curve shape para- 
eters, are often best specified interactively A similar situation exists when 
orking with pseudo- and full-color mappings 一 particularly when human 
ewing and interpretation (e.g., for color balancing) are involved. In such ap- 
ications, the selection of appropriate mapping functions js best accomplished 
by directly manlpulating graphical representations of candidate functions and 
ewing thelr combined effect (in real time) on the images being processed. 
Figure 6.11 ilustrates a simple but powerful way to specify mapping func- 
tions graphically. Figure 6.11(a) shows a transformation that is formed by lin- 
arly interpolating three comatrol pointx (the circled coordinates in the 和 gure ); 
Fig. 6.11(b) shows the transformation that results from a cubic spline interpo- 
atlon of the Same three points; and Figs.6.11(c) and (d) provide more complex 
jnear and cubic spline interpolations, respectively Both types of Interpolation 
are Supported In MATLAB. Linear interpolation is implemented by Using 
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Gray-scale image RGB color image 


Z = interp1q(X，y，Xzi) 





Which returns a column vector contalining the values of the linearly interpolat- 
dT-D function z at points xi. Column vectors x and y Speclify the horizontal 
ad vertical coordinate pairs of the underlying control points. The elements of 
一 must Increase Ionotonically. The length of z is equal to the length of xi 
这 遇 Color Jransformations Thus, for example， 

The techniques described in this section are based on Processing the color 
components of a color image or intensity component of a monochrome Image 
within the context of a single color model. For color images, we restrict atten- 
tion to transformations of the form 


5 = =12...7 


where 记 and sw are the color components of the input and output Images, mils 
the dimension of (or number of color components in) the color space of 六 , and 
the Zare referred to as AlLcolor Pa13sjorpzation (or 11aappizag) functions， 

过 the input images are monochrome, then we write an equation of the form 


3i 二 《人 2 


where r denotes gray-level values， yand Zare as above,andmisthe number of 
color components in y .This equation describes the mapping of gray levelsinto 
arbitraty colors, a process frequentiy referred to as a pserudocolor ta113jo77114- 

1OH1 Or pserdocolor7maapping.Note that the first eqtation can be used to process 
monochrome images in RGB spaceifwelet 娓 三 户 三 力 = 广 JIneithercase'the 


>> z = interp1q([0 255],，[0 255],，[0: 255] ') 


produces a 250-element one-to-one mapping connecting control Points (0, 0) 
and (255, 255) 一 thatisz = [0 1 2 .。 . ，255]'， 


-0.5 0.5 


0.25 0.25 
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“FlIGURE 6.11 Specifying mapping functions using control points: (a) and (c) linear interpolation，and 
人 {b) and (d) cubic spline interpolation， 
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TIABLE 6.4 
Valid inputs for 
function ice， 
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ob:typlng hn at the prompt lists all the properties of the processed lmage (See 
Sections 2.10.6 and 11.11 ftor an explanation of Structures). To extract a Partic- 
llar PTOPerty, We typeh.PropertyName， 

Letting f denote an RGB or monochrome image, the following are exam- 
les of the syntax of function ice: 


In a similar manner, cubic spline interpolation 8 implemented using the 
spLine function， 


zZ = Spline(X，yY，Xi) 


where variables z, x, y, and xi are as described in the previous paragraph for 
interp1q.However,the xi must be distinct for use in function sp1ine. More- 
over, 让 y contains two more elements than X, lts first and last entries are as- 
sumed to be the end slopes of the cubic spline. The function depicted in 
Fig. 6.11(b),for example, was generated using zero-vValued end slopes. 

The specification of transformation functions can be made interactive by 
graphically manipulating the control points that are Input to functions 
interp1q and spline and displaying in real time the results of the transfor- 
mation functions on the images being Processed. The ice (interactive color 
editing) function does Precisely this. Hts SyntaX 1S 


ce 0nJy the ice graphical 
interface 1S diSpJLayed， 

icel( ' image ， 了 于 ) ; Shows and returns the mapped 

image g. 

Shows g and returns 

the hand]1e， 


Maps RGB image f in HSI Space， 


《c 
4 


icel inage ，f，' Wait ， off ) | 


(C2 
绩 
Do oo oocoe oe oo ce 


icel inage ，Tf， Space ， hsi ) 


YY 
> 
社 


g = ice('Property Name' ，'Property Value ，，. .|) 
utput. The output of ice js always RGBi; its Input ls always monochrome oOT 


RGB. Hwetypeg = ice( inage'，f),animage and graphical user Interface 
GUD like that shown in Fig. 6.12 appear on the MATILAB desktop. Initialjy, 


where 'Property Name' and 'Property Value must apPear in palrs, and 
the dots indicate repetitions of the pattern consisting of corresponding Inpnut 
pairs. Table 6.4 lists the valid pairs for use in function ice.>0me eXatmnples are 
given later in this section. 

With reference to the 'wait ' parameter, when the 'on' option ls Sejected 
either explicitly or by default, the output g is the processed image. In thjls case， 
ice takes control of the Process, including the cursor, So nothing can be typed 
on the command window until the function is closed, at which time the final 
result is image g. When 'off' is selected, g is the Partdljef of the processed 
image, and control is returned immediately to the command windowi; there- 
fore new commands can be typed with the ice function still active. To obtain 
the properties of an image with handle g we use the get function 


hn = get(g) 


This function returns all properties and applicable cuIrent values of the graph 
ics object identified by the handle g. The Properties are stored in structuIe h 
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Property Name Property Yalue 

' image， An RGB or monochrome input image, f,to be transformed by 
interactively specified mappings. 

Space The color space of the components to be modified, Possible 


values are 'Fgb'，'cmy'，'hsi'，hsv' ntsc' (or yiq'and 
'ycbcr '.The default is 'rgb . 
'Wait， H'on' (the default)，g is the mapped input image. 芝 "off，9 
is the handle of the mapped input Image. 


-一 
tfWhenever MATLAB creates a graphics object, it assigns an identifier (called a Fardie) to the object 
used to access the object's properties. Graphics handiles are Useful when modifying the apPearance 9 
graphs or creating custom plotting commands by writing M-files that create anq manipulate ObjeC”: 
directly. 


ee ， 
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TABLE 0.5 
Manlipulating 
control points 
with the mouse. 


了 上 入 AMIPLE 6.3: 
Inverse mapPplngs: 
monochrome 
negatives and 
color 
complements. 


TABLE 0.0 
Function of the 
checkboxes and 
pushbuttons in 
the ice GUI 








MIouse Actiont Result 












Lett Button Move control point by pressing and dragging. 
Left Button 二 Shift 天 ey Add control polnt TIhe location of the control point: 
can be changed by dragging (while stili pressing the: 
Shift Key). 由 后 入 下 0 


Lett Button + Control Key Delete control point. 



















人 


+ For three button mice, the left, middle and right buttons correspond to the move, add, and delete Opefl 
ations in the table. 


人 


the transformation curve is a straight line with a control point at each end 
Control points are manipulated with the mouse, as SUummarized in Table 65 
Table 6.6 lists the function of the other GUI components. The following exam 
ples show tyPpical appPlications of function ice， 


ndividual 人 ,and 如 maps are leftin their1:1 default states (see the Compo- 
ent entry in Table 6.6). For monochrome inputs, this guarantees monochrome 
utputs, Figure 6.13(b) shows the monochrome negative that results from the 
erse mapping. Note that it is identical to Fig. 3.3(b), which was obtained 
Sing the imcomplement function. The pseudocolor bar in Fig. 6.13(a) is the 
hotographic negative”of the original gray-Scajle bar in Fig. 0.12. 

nVeIse OF negative mapping functions also are useful in color processing. 
can be seen in Figs. 6.14(a) and (b),the result of the mapping is reminiscent 
fconventional color film negatives. For instance, the red stick of chalk in the 
ottom row of Fig. 6.14(a) is transformed to cyan in Fig. 6.14(b) 一 the color 


鄙 Figure 6.13(a) shows the ice interface after the default RGB curve 0 
RPRg. 6.12 1s modified to produce an inyverse or negative mapping function. 
cTeate the new mapping function,control point (0, 0) is moved (by clicking a 
dragging lt to the upper-left corner) to (0, 1), and control point (1,1) imove 
Slimllarly to coordinate (1, 0). Note how the coordinates of the cursor are dis 
played in red in the InputbOutput boxes. Only the RGB map is modified; 也 





GUI Element Functiotm 































Smooth Checked for cubic spline (Smooth curve) interpolation.I 喧 ， 
unchecked, piecewise linear interpolation is used. O11DLe71e71i of Ted. JTJhe complement of a primary color is the mixture of the 
Clamp Ends Checked to force the starting and ending curve slopes in cubic other two PITimaries (e.g., cyan is blue plus green). As in the gray-scale case， 
spline interpolation to 0. Piecewise linear interpolation is not jor complements are useful for enhancing detail that is embedded in dark 
affected. 8 了 xfeglons of color~-particularly when the regions are dominant in size.Note that 
Show PDF Display probability density function(s) [i.e., histogram(s)] of the eFr1i-color Bar in Rig. 0.13(a) contains the complements of the hues in the 
image components affected by the mapping function. rcoior Bar of Fig. 6.12， 
Show CDF Display cumujative distribution function(s) instead of PDFs. 
(Note: PDFs and CDFs cannot be displayed simultaneousjy ) 
Map Imapge 坟 checked, image mapping is enabled; otherwise it is not， 
Map Bars 于 checked,pseudo- and full-color bar mapping is enabled; 










otherwise the unmapped bars (a gray wedge and hue wedge， 
respectively) are displayed. 
Reset Inltiallize the currently displayed mapping function and uncheck- 
all curve parameters. 
Reset Al Jnitiallize all mapping functions， 
Jnput/Output Shows the coordinates of a selected control point on the 
transformation ctrve. IDPut refers to the horizontal axis, and 
Outpnut to the vertical axXils. 
Component sejlect a mapping function for interactive manipulation, In RGB 
Space, posslbie selections inciude R, G, B,and RGB (which maps 
all three color components), In HSI Space,the options are 酝 , $ 了 
and HSI, and so on. 一 
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ab 


FlIGCURE 6.13 

(a)j Anegative 
mapping function， 
and (bj its effect 
on the 
monochrome 
image of Fig. 6.12. 


有 efapir 人 Ge 了 
PCDPIRRS Gre 7OL 
SPomwm Er PTOSt 
exkaHtpie5. 


:和 必 : 


FIGURE 6.14 

(al) A full color 
image,and (bj its 
negative (color 
compiement). 
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EXAMPELEE 0.4: 
Monochrome and 
colior coOntTrast 
enhancement， 






a bfC 
det 


FIGURE 6.15 Using function ice for monochrome 
input images，both of which have a “washed-out”apPeatrancei; (b) and 
(c) and (f) are the ice displays(OTiginal monochrome image for this example courtesy of NASA.) 


潮 Consider next the use of function ice for monochrome and color contras 
manipulation. Figures 6.15(a) through (c) demonstrate the etfectiveness of 


ice in processing monochrome images. 上 lguUTIeS 6.15(d) through (有 sho 
similar effectiveness for color inputs. As in the previous exampie, mappin 
functions that are not shown remain in their default or 1 ;1 state. In both pro 


cessing Sequences, the Show PDF checkbox 18 enabled.Thus, the histogram of 


the aerial photo in (a) is displayed under the gamma-shaped mapping fun 
tion (see Section 3.2.1) in (c); and three histograms 3Ie provided in (f) fo 
the colorimage in (d) 一 one for each of its three color components， Althoug 
the S-shaped mapping function in ( 纪 IncreaseS the contrast of the Image i 


(d) [compare lt to (e)j],it also has aslight effect on hue. The smail change 叶 


color is virtually imperceptible in (ej, but is an Obvl0UsS Tesujt of the map 


ping, as can be seen in the mapped full-color reference bar in (全 . Recajll from 


the previous example that equal changes to the three components of a 
RGB image can have a dramatic effect on color (see the color compleme 


mapping in Fig. 6.14). 


-seeno-vewe1tver 钻 ---- 嚼 ~- 
1 草 


and full color contrast enhancement': (a) and (d) are 包 
(e) show the processed TeSHt 


0.4 恤 Color Transformations 


The red, green, and blue components of the input imasges in Examples 0.3 and 
4are mapped ldenticaly 一 that ls, using the same transformation function. To 
void the Specification of three identical functions, function ice provides an “alj 
omponents " function (the RGB curve when operating in the RGB color space) 

atis used to map all input components. IJhe remalining exampjles demonstrate 
ansformations in which the three components are processed ditferentjy 


As hoted earlier, when a monochrome image 1$ represented In the 人 OB 
olor space and the resulting components are mapped independently the 
ansformed result 1S a pseudocolor Image in which Input image gray levejs 
avye been replaced by arbitrary colors. Transformations that do thls are Useftul 

catuse the human eye can distinguish between millions of colors 一 but Tela- 
yely few Shades of gray, Thus, pseudocolor mappings are Used frequently to 
lake Small changes in gray level visible to the human eye Or to highlight Im- 
Drtant gray-sScale reglions. In fact, the Principal use of pseudocoloer is human 
Sualization 一 the interpretation of gray-scale events in an image Of Sequence 
fimages Vla gIay-to-color asSlgnments. 

Figure 6.16(aj is an X-ray image of a weld (the horizontal dark region) con- 
ining Several cracks and porosities (the bright white streaks running throughb 
e mijiddle of the image). A pseudocolor version of the image in shown in 





正 XAMPLE 0.9: 
Pseudocoler 
mappings. 


ab 

cx 各 

FIGURE 6.16 

(a) 共 -Tay of a 
defective weld; 
(b) a pseudo- 
coOlor veTSlon cf 
the weld; (c) and 
(d) mapPing 
functions for the 
green and blue 
cofmponents. 
(Original jimage 
COUrtesy of 准 - 
TEK Systems， 
Ltd .) 
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1ocess the Input in a variety of color spaces, as detailed in Table 6.4.To inter- 
ctively modify the CMY components of RGB image f1,for example, the ap- 
， Topriate Ice cajl ls 


Fig. 6.16(bj; it was generated by mapping the green and blue components 0 
the RGB-converted input using the mapPping functions in Figs. 6.16(c) and (d) 
Note the dramatic visual difference that the pseudocolor mapping makes.The 
GUI pseudocolor reference bar Provides a convenient Visual guide to the 
composite mapping. As can be seen iD Figs. 0.160(c) and (dj, the interactivejy 
specified mapping functions transform the black-to-white gray Scale to huet 
between blue and red, with yellow reserved for whlte. IJhe yellow, of course 
coOrTeSponds to weld cracks and porosities, which are the Important features 还 


this examPle. 


2 = icel( image ，fl1， ' space'，':CMY ' ) ; 


Fig. 6.17 shows, a Small decrease in magenta had a significant impact on 
ge color. 性 


量 ,Histogram equalization is a gray-level mapping process that seeks to pro- 
duce monochrome images with uniform intensity histograms. As discussed in 
ction 3.3.2, the required mapping function is the cumulative distribution 
nction (CDE) of the gray levels in the input image. Because color images 
muljtiple components, the gray-scajle technique must be modified to han- 
nore than one component and associated histogram. As might be expect- 
sd it is unwise to histogram equalize the components of a color imape 
Independentiy TIhe result usually is erroneous color.A more logical approach 
to Spread color intensities uniformly, leaving the coiors themselves (le., the 
es) Unchanged. 
Kigure 6.18(a) shows a color Image of a caster stand containing cruets and 
akers, The transformed image in Fig. 6.18(bj, which was produced using the 
JItransformations in Figs.6.18(c) and (dj,is significantly brighter. Several of 
,the moldings and the grain of the wood table on which the caster is resting are 
00W Visible. The intensity component was mapped using the function in 
148,0.18(c), which closely appProximates the CDEF of that component (also dis- 
yed in the figure).The hue mapping function in Fig. 6.18(d) was selected to 
ove the overajll color perception of the intensity-equajlized result. Note 
Mat the histograms of the input and outpnut Image's hue, saturation, and inten- 
omponents are shown in Figs. 6.18(e) and (f, respectively The hue com- 
onents are YIrtually identical (which is desirable)，while the intensity and 
aturatlon coImponents were altered. Finally note that, to Process an RGB 
age in the HSI color space, we included the Input Property name/value pair 
ace /hsi' inthecallto ice， 于 


EXAMDPLE 6.6: 盟 Figure 6.17 shows an application involving a full-color image, in Which it 闪 
Color balancing， advantageous to map an image's color components independently Common 
called color palancing Or coior correctiomn, thils type of mapping has been 
mainstay of high-end color reproduction Systems but noOw can be performed 
on most desktop compnuters. One lmportant use 1$ photo enhancement. Aj 
though color imbalances can be determined objectively by analyzing- 一 with a 
color Spectrometer 一 a known color In an image, accurate visual asSseSsIments 
are possible when whlte areas, wbere the RGB or CMY components Should 
equal, are present. As can be seen jn 瑟 g. 6.17, Skin tones also are excelle 
Samples for visual assessments because humans are hlghjly Perceptive of PrIo 
er Skin cojlor. 
Figure 6.17(a) shows a CMY scan of a mother and her child with an eXcesS: 
of magenta (keep in mind that only an RGB version of the image can be di 
played by MAILAB). For simplicity and compatibilty with MAILAB, fun 
tion ice accepts only RGB (and monochrome) Inputs as Well 一 but 
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赤 尝 一 一 


The output images generated in the Preceding examples in this section are of 
pe RGB and class uint8. For monochrome resujts, as in Example 6.3, all three 
mponents of the RGB output are jidentical. A_ more compact Tepresentation 
nbe obtained via the rgb2gray function of Table 6.3 or by using the command 
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3 bcG: 


FIGURE 6.17 Using function ice for color balancing: (a) an image heavy in magenta; (bj tbe correctec 
image'; and (c) the mapping function used to correct the imbalance， - 


here f2 ls an RGB image generated by ice and f3 is a standard MATILAB 
monochrome image. 
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下 上 和 AMPLE 6.7: 
下 istogram based 
mappings. 
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ab 

Cs 

e 于 

FIGURE 6.18 
Histogram 
equalization 
followed by 
Saturation 
adjustnmaent in the 
HSI color Space: 
(a) Input Image; 
(b) mapPped 
TeSult; 

(cj intensity 
component 
mapping function 
and cumujativVe 
distribution 
functioni 

(d) saturation 
comPonent 
mappPing functioni 
(e) input Image S 
comPponent 
histograms; and 
(人 maPped 
TeSult's 
compPonent 
histograms. 
































有 Spatial Filtering of Color Images 


市 e material in Section 6.4 deals with color transformations performed on Sin- 
image PIXejls of single color component planes. The next level of complexi- 
involves performing Spatial neighborhood Processing, also on single image 
lanes. TIhls breakdown is analogous to the discusslon on intensity transforma- 
ons in Section 3.2, and the discussion on Spatial fltering in Sections 3.4 and 
5 We introduce spatial filtering of color images by concentrating mostly on 

GB images, but the basic concepts are appjicable to other color models as 
].We illustrate spatial Processing of color images by two examples of lnear 
ering: image Smoothing and image Sharpening. 


.] Color Image Smoothing 


th reference to Fig. 6.10(a) and the discussion in Sections 3.4 and 3.5， 
oothing (Spatial averaging) of a monochrome image can be accormplished 
multiplying all Pixel values by the corresponding coefficients in the spatial 
Mnask (which are all 1s) and dividing by the total number of elements in the 
nask. The Process of Smoothing a full-color Image Using Spatial masKkSs 1jS 
19wWn in Fig. 6.10(b). Thbe Process (im RGB space for example) is formulated 
he same way as for gray-scale images, except that instead of single pixels we 
W deal with vector values in the form shown in Section 6.3. 

.et 9xy denote the set of coordinates defining a neighborhood centered at 
7y) in a color image. TIhe average of the RGB vectors in this neighborhood is 


和 0 2 让 





工 

G(7， y) = 也 人 >， c(3， ) 
(5 DJ)ESuy 

ere 玉 ls the number of Pixejls in the nelghborhood. It follows from the dis- 

nssion in Section 60.3 and the properties of vector addition that 
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Tecognize each cormmponent of this vector as the resujlt that we would obtain 
performing neighborhood averaging on each individual component Image， 
ing standard gray-scale neighborhood processing. Thus，we conclude that 
noothing by neighborhood averaging can be carried out on an independent 
mponent basis. The resujlts would be the same as 让 neighborhood averaging 
ere carTied out directly in color vector SPace. 

As discussed in Section 3.5.1, IPT linear spatial filters for image smoothing 
egenerated with function fspecial, with one of tbree options: "average '， 
disk' ,and 'gaussian' (see Table 3.4).Once afilter has been generated, fil- 
Tinp is performed by using function imfilter, introduced in section 3.4.1. 
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上 上 和 AMPLE 0.8: 
Color Image 
smoothinpg. 
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Conceptually smoothing an RGB color image, fc, with a linear spatial ft ab 
consists of the following steps: c d 
1 上 xtract the three component imapges: Re 
(b) through 
> (d) are the red， 
green and bjue 
2 Filter each component Image individually. For example,lettingw Tepreseiit component 
asSmoothing filter generated Using fspecial, we Smooth the red compags lma8es， 
resSpectlvely. 


nent Image as follows: 
>> fR_filtered = jimfilter(fR，W) ; 


and simjilarly for the other two component imapges. 


3。 有 econstruct the ftered RGB image: 
>> fc_fiitered = cat(3，TfR_filtered，TfG filtered，fB_ filtered)， 


However we can perform linear filtering of RGB images in MATLAB usin 
the Same Syntax employed for monochrome images, allowing us to combin 
the Preceding three steps into one: 


>> fc_filtered = imfilter(fc，wW) 


镭 Figure 6.19(a) shows an RGB image of size 1197 X 1197 PlIxXels an 
Figs. 0.19(b) through (d) are its RGB component images, extracted Using h 
procedure described iD the previous paragraph, Figures 6.20(a) through ( 
Show the three HSIcomponent images of Fig. 6.19(a)j, obtained using function 
rgb2hsi， 


ference between smoothing in RGB space and attempting to achieve a Simillar 
resujlt using only the intensity component of the image after it had been CODn- 
verted to the HSI color space，Figure 6.21(b) was obtained using the 
commands: 


>> h = rgb2hsi(fc)， 
人 

>> W = fsSpecial('average'，25) ; 

>> Tfiltered = imfilter(I，w， "replicate')， 

>> h = cat(3，H，S，I filtered) ， 

>> ff = hsi2rgb(h); 

>> =min(f，T); 5 RGB images must have values in the range [0，1]， 


>> ImSshowf{ 了 ) 
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a bc 


FIGURE 6.21 (a) Smoothed RGB image obtained by smoothing the 尺 ,C, and 号 image Planes separatej 优 
(b) Result of smoothing only the intensity component of the HSI equivalent j image.(c) Result of smoothbin 有 
ajl three 瑞 SI components equally. 


Clearly,the two filtered results are quite different. For examplie, in addition tg 
the image being less blurred, note the green border on the top Part of tbe 
flower in Fig.6.21(b).The reason for this is Simpjly that the hue and Saturation 
components were not changed while the variability of values of the intensi 媒 
components was reduced significantljy by the Smoothling ProceSsS. 人 A logical 
thing to try would be to smooth all three components Using the Same 人 ter， 
However, this would change the relative relationshijp between values of the 
hue and saturation, thus producing nonsensical colors, as Fig. 6.21(c) shows, 

In general, as the size of the mask decreases, the differences obtained whem 
filtering the RGB component images and the intensity component of the HSI 
equlivalent Image also decrease， 


Figure 6.22(a) shows a slightiy blurred version，fb， of the image in 
8 6.19(a), obtained using a $ X 3 averaging filter, To sharpen thls Image we 
ed the Laplacian flter mask 


人 = [111)1-81111]) 


en, as in Example 3.9, the enhanced image was computed and displjayed 
using the commands 


.fen = imsubtract(fb，imfilter(fb，1apmask， "replicate ) ) 
>> imsShow(fen) 
8.3.2 Color Image Sharpening 
Where we combined the two required steps into a single command. As in the 
previous section, RGB images were treated exactiy as monochrome images 
(Le., with the same calling Syntax) when using imfilter.Figure 6.22(b) shows 
the result. Note the signiicant increase in Sharpness of features Such as the 
Water droplets, the veins in the leaves, the yellow centers of the flowers, and 
the green vegetation in the foreground， 强 


Sharpening an RGB color Image with a linear Spatial filter follows the Same 
procedure outlined in the previous section, but using a Sharpening 和 ter 误 = 
stead. In this section we consider image sharpening using the Laplacian (see 
Section 3.5.1). From vector analysis, we know that the Laplacian of a Vector 允 
defined as a vector whose components are equajl to the Laplacian of the indiz 
vidual scalar components of the input vector. In the RGB color System， the 
Laplacian of vector cintroduced in Section 6.3 1S 


V<R(x， 
Vz[e(x, 让 ] = ww， 6 Working Directly in RGB Vector 9pace 
V2B(z， 7 As mentioned in Section 6.3, there are cases in whlch ProcessesS based on indi- 


Vidual color planes are not equivalent to working directiy in RGB vector 
Space， This is demonstrated in this section, where we llustrate Vector PTOCeSS- 
ing by considering two important applications in color image Processing: color 
edge detection and region segmentation. 


which, as in the previous section,tells us that we can compnute the Laplaclan 0 
a 名 1Lcolor image by computing the Lapjacian of each component 1mna 路 
Separately. 
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3a_b 


FIGURE 6.22 

(a) Blurred Image. 
(b) Image 
enhanced using 
the 志 Lapjlaclan， 
foljlowed by 
coOntrast 
enhancement 
Using function 
1Ce. 


EXAMODPLE 6.9: 
Color imapge 
sharpening. 
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colorgrad 
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下 XAMPELELEE 6.10: 
RGB edge 
detection Using 
function 
coLorgpnad. 






direction of maxlimum rate of change of c(z, y) as a function (x, y) is given 
by the angle 


1 8 
bx yy = 二 tan7| 一 一 一 一 一 
7 2 4 的 


and that the value of the rate of change (ie.,the magnlitude of the gradient)i 
the directions given by the elements of 9(x,，y) 1 given by 


































1 人 
友 (x, y) = 世 (区 20 二 250 20]} 


Note that g(xz, y) and Zoo(x, y) are images of the same size as the input image 
The elements of bg(x, y) are Simply the angles at each point that the gradient if 
calculated,and zFo(x, y) is the gradient image. 

Because tan(a) = tan(w 土 T), 这 bois asolution to the preceding tarm 
equation, SO is 00 士 T/2. 了 Furthermore, 而 (z 》) = 古 +r(x, yy)， So 已 needs to 
be computed onlyforvalues of g in the half-open interval [0, 六 ). The fact that 
the tan equation provides two values 90?* apart means that this equation 3S- 
Soclates with each point (xz, y) a pair of orthogonal directions. Along one of 
those directions ls mnaximum, and it is minimum along the other,so the final 
resujt ls generated by selecting the maximum at each point. The derivation of 
these Tesujts ls Tather ljengthy, and we would gain little in terms of the funda- 
mental objectlive of our current discussion by detailing it here.The interested 
reader Shoujd consujt the paper by Di Zenzo [1986] for details.The partial de- 
rivatives required for implementing the preceding equations can be compnut- 
ed using, for example, the Sobel operators discussed earlier in this section 

TIhe following function implements the color gradient for RGB images (see 
Appendix C for the codey): 


了 


[6 APFTG] = colorgrad(T，T) Letting f represent the RGB image in Fig. 6.24(d)j, the command 
where fls an RGB image,Tis an optional threshold in the range [0,1] (the de- 
faujt is 0); VG is the RGB vector gradient 玉 (xy); Ais the angle image 0(x, y)， 
in radians; and PPG is the gradient formed by summing the 2-D gradients of the 
Individual color Planes (generated for comparison Purposes),.These latter gra- 
dients are VRUz yy VCGUr y), and VB(xz, y)，where the V operator is as de- 
fned earlier in this section, All the derivatives required to implement the 
preceding equations are implemented in function coLlorgrad using Sobel oper- 
ators. The outputs VG and PPG are normalized to the range [0,1] by colorgrad 
and they are thresholded so that VG(x，y) = 0forvalues less than or equalto 
TandVvG(x，y) = VG(X，y) otherwise. Similar comments apply to PPG. 


[Tv6G6，A，PPG] = colorgrad(f); 


:produced the images VG and PPG shown in Figs. 6.24(e) and ( 提 . The most im- 
ortant difference between these two results js how much weaker the horizon- 
al edge in Fig. 6.24(f) is than the corresponding edge in Rig. 6.24(e), The 
eason is simple: The gradients of the Ted and green planes [Figs, 6.24(a) and 
b)] produce two vertical edges, while the gradient of the blue plane yields a 
Imgle horizontal edge. Adding these three gradients to form PPG Produces 3 
ertical edge with twice the intensity as the horizontal edge. 
On the other hand, when the gradient of the color image is computed directly 
了 Vector space [Fig.6.24(e)],the ratio of the values of the vertical and horizontal 
dges is V27 instead of 2.The reason again is Simple: With reference to the color 
eubein Fig.6.2(a) and the image in Fig.6.24(d) we see that the vertical edge in the 
oblor image is between a blue and white Square and a bjack and yellow square. 
e distance between these colors in the color cube is V2 .but the distance be- 
een black and blue and yellow and white (the horizontal edge) is only 1.Thus 
he ratio of the vertical to the horizontal differences is V5.I edge accuracy is an 





蝇 Pigures 0.24(a) through (c) show three simple monochrome images which， 
when used as RGB planes, Produced the color image in Fig. 6.24(d). The ob- 
jectives of this example are (1)toillustrate the use of function colorgrad,and 
(2) to show that compnuting the gradient of a color image by combining the 
gradients of its individual coior planes is quite different from computing the 
gradient directiy in RGB vector Space using the method just explained， 


6.6 盟 Working Directly in RGB Vector Space 
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FICURE 6-24 (a) through (c) RGB component images (black is 0 and white is 255). (d) Corresponding color 
ge.《e) Gradient comphlted directly 记 RGB vector space， (人 Composite _ gradient obtained by 
iputing the 2-D gradient of each RGB component image separatejy and adding the resujts. 
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FIGURE 6.25 

(a) RGB Image. 
(bj) Gradient 
computed in RGB 
Vector Space. 

(c) Gradient 
computed as ln 
Fig. 6.24( 们 ， 

(d) Absolute 
difference 
between (b) and 
(cj,scaled to the 
Tange |0, ]]. 
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issue, and especlally when a threshold is used, then the difference between thes: 
two appProaches can be significant. For example,if we had used athreshold of 0.6 
the horizontal jne im Fig. 6. 24 亿 would have disappeared. 

In Practice, when interest is mostly on edge detection with no regard for ac 二 
curacy, the two approaches just discussed generaliy yield comparable resul 
For example, Figs. 0.25(b) and (c) are analogous to Figs. 6.24(e) and (f).Th 
were obtained by applying tunction colorgrad to the Image in Fig, 6.25(a 
Figure 0.25(d) is the difterence of the two gradient images, scaled to the rang 
[0, j. IThe maximum absolute difference between the two images js 0.2, whi 
translates to 91 gray levels on the famijliar 8-bit range [0,253S]. Howevyver, these 
two gradient lmages are quite close in visual appearance，with Fig. 6.25( 
being sllightly brighter in some places (for reasons similar to those explainedir 
the Previous paragrapbh). TIhus, for this type of analysis, the simpler approa 
of computing the gradient of each individual component generally is accept: 
able. In other circumstances (as in the inspection of color differences in auta: 
mated machine lnspection of painted products), the more accurate vectof 
appioach may be necesSarVy. 


.6.2 Image 5egmentation in RGB Vector Space 


gmentation 1$ a process that partitions an image into Iegions.Although seg- 
ntation ls the topic of Chapter 10, we consider color region Segmentation 
efly here for the sake of continuity. The reader will have no difficulty fol- 
Wing the discussion， 

Color reglon Segmentation Using RGB color vectors is Straighttorward. Sup- 
ose that the objective is to Segment objects of a specified color range in an 
GB image. Given a set of sample color points representative of a color (or 
ange of colors) of interest, we obtain an estimate of the“average”or “Imean” 
olor that we wish to Segment. Let this average color be denoted by the RGB 
lumn vector m.The objective of segmentation is to classify each RGB Pixel in 
image as having a coior in the specified range or not. To perform this com- 
arison, we need a measure of Similarity. One of the simpjlest measures is the 
Euclidean distance. Letz denote an arbitrary point in RGB space.We say thatz 
Si7tiartoimnitthe distance between them is less than a specified threshold, 了 
he Euclidean distance betweenz and m is given by 


ZD(z,m) = 二 z 一 囊 | 


= [(z 一 m)7(z 一 四 )] 
[(zR 一 mapi +(zc 一 mc +(zs 一 


Where 用 is tbpe norm of the argument, and the subscripts 尺 , G, and 瑟 , denote 
e RGB components of vectors m and z. The locus of points such that 
( m) 三 了 isasolidsphere ofradius 7,asillustrated in Fig. 6.26(a). By def- 
nltion, Points contained within，oOr on the Surface of the Sphere satisfy the 
ecified color criterion; points outside the Sphere do not. Coding these two 
ets of points in the image with, say black and white， Produces a binary, Seg- 
fnented image， 
A useful generalization of the preceding equation is a distance measture of 
the form 


1/2 
1115)] 





m)7C-i(z 一 mm)] 7 


D(z, mm) 三 


[z - 
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Foliomwjirg coFyveEF- 
站 OH， We LSe 4G SUHDEr- 
SC1DE 三 IO Edicate 
YEctOr Or 174 厅 下 
frC1TSPDOSIEOR CPPG 2 
OFTIGL FLe 了 to 
derzofeG 雪 resHroida 
PEIe. Care SPorid 
PPe Exercised HoOt to 
CO1j15e 切 ese Mre- 
iated LU5es OF 芒 e 
Sa1Te Variapie， 


ab 

FIGURE 6.26 Two 
appPIToaches for 
enclosing data in 
RGB vector Space 
for the Purpeose of 
segmentation， 
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Jee ectiomr 72.2 jor 
adetaerled discrussyior 
OM eiciemtirapie- 
11eE11Glio1S 有 rcoj1- 
PHIEB8 IRe Exciidean 
270 Manaiarzopis 
Cstamcey 


coJLorseg 
5 


了 上 XAMPELE 6.11: 
RGB coilor image 
segmentation. 


ab: 

FiICURE 6.27 

(a) Pseudocoior 
of the surface of 
Jupiter's Moon Io. 
(b) Region of 
interest extracted 
inteTactively using 
function roipoly. 
(Original image 
coOurtesy of 
NASA.) 





































nask = Foipoly(f)， %5 Select region interactively， 
= immultipIy(mask， 了 f(:，:，1T)); 

een = immultiply(mask，f(:，:，2)); 

lue = immultiply(mask，f(:，:，3)); 

= cat(3，red，green，blue 
gure，imshow(g) 


where C is the covariance matrixf of the samples representative of the col 
we wish to segment. This distance is commonly referred to as the Mapa1an 
distance. The locus of points such that D(z, m) 三 T describes asolid3-D el 
tcal body [see Fig. 6.26(b)] with the important property that its Principal a 
are oriented in the direction of maximunm data spread. When C = Itheids 
tity matrix, the Mahalanobis distance reduces to the Euclidean distance, Se 
mentation is as descTibed in the preceding paragraph, except that the data 
DOW enclosed by an ellipsoid instead of a sphere. 

egmentation in the manner just described is jimplemented by functiof 
Colorseg (see Appendix C for the code), which has the syntax 


ere mask is a binary image (the same size as f) with 0s in the background 
iis in the region selected Interactivejiy. 
Next, we compnute the mean Vector and covariance matrix of the points in 
he ROL but first the coordinates of the points in the ROI must be extracted. 


[M，N，K] = size(g); 

工 = Preshape(g， M* N，3); % reshape is discussed in Sec， 8.2.2. 
idx = find(mask)， 

= double(I(idx，1:3) ); 

[C，mj = Covmatrix(I); % See Sec， 11,.5 for details on covmatPix . 


S = CO0lorseg(method，f，T，parameters) 


where method is either 'euclidean' or 'mahaLanobis',fis the RGB Imag 
to be segmented,andTis the threshold described above,The input Parameter 
are eltherm 过 :euclidean' is chosen,ormandC 让 ,mahalanobis' is chosenm 
Parameter m is the vector, mn described above, in either a row or column for 
mat, and C is the 3 X 3 covariance matrix，C, The output, S, is a two-leve 
image (of the same size as the original) containing 0s in the points failing 也 
threshold test, and 1s in the locations that Passed the test. The ls indicate 
regions Segmented from f based on color content， 


iese are the non-background pixels of the masked image in Fig. 6.27(b). 
:The fnal preliminary computation is to determine a value for 人 A good 
。 tarting point is to letTbe a mnultiple of the standard deviation of one of the 
是 Figure 6.27(a) shows a pseudocolor Image of a region on the surface of ti 
Jupliter Moon Io. In this image, the reddish colors depict materials newly eject 
ed from an active volcano, and the SUIIOUnding yellow materials are older sul .Square Ioots: 
fur deposits. This example ilustrates segmentation of the reddish Tegion Using 
both options in function colorseg. 
First we obtain samples representing the range of colors to be segmented 
One Simple way to obtain such a region of interest (ROD is to use function 
roipoly described in Section 5.2.4, which produces a binary mask of a region 
sejected interactively. Thus, letting f denote the color image in Fig.6.27(a),the 
region in 矶 g. 6.27(b) was obtained using the commands 


人 


dz= diag(C)) 
:Sd = Sqrt(d): 


22.0643 ”24.2442 “16.1806 


he first element of sd is the standard deviation of the red component of the 
olor Pixels in the ROL and similarly for the other two components. 
We now proceed to segment the image using values of 六 equal to multiples 
f25, which is an approximation to the largest standard deviation: 厂 = 2S, S0， 
100. For the 'euclidean， option with 了 = 2$, we Use 


> FE25 = Colorseg('euclLlidean'，f，25， 站) ; 


gure 6.28(a)j Shows the result, and Figs. 6.28(b) through (d) show the seg- 
entation results with 7 = 50, 75, 100. Similarly Figs. 0.29(a) through (d) 
WwW the results obtained using the 'mahalanobis， option with the Same se- 
lence of threshold values， 
Meaningful results [depending on what we consider as red in Fig. 6.27(a)] 
Were obtained with the 'euclidean' option when 人 = 25 and 50，bnut 
= /5 and 100 produced significant oversegmentation. On the other hand， 
the results with the 'mahalanobis' option make a more sensible transition 


二 


Computation of the covariance matrix of a set of vector Samples is discussed in Section 11.5. 
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d = diag(C) 
relrt3 i Vectord 
thje aain diagsonal of 
1TQtrEX C. 
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a hb rincreasing values of 7 The reason is that the 3-D color data spread in the 
c d OTis fitted much better in this case with an ellipsoid than with a sphere. 
FIGURE 6.28 Jate that in both methods increasing 了 allowed weaker shades of red to be 
(a) through 本 cluded in the segmented regions, as expected. 外 


(dj Segmentation 
of Fig. 6.27(a) 
using Option 
'euclLidean' in 


with 在 Eolor in image Processing, and on the implementation of these concepts using 
了 二 2530 73， ATLAB,IPI,and the new functions developed in the preceding sections,. The area of 
and 100， lor models is broad enotugh so that entire books have been written on just this topic， 
TeSPectively. emodels discussed here were selected for their usefuiness in image processing, and 
人 So because they provide a good foundation for further study in this area. 
The material on pseudocolor and fujl-color Processing on individual color planes 
ides a tie to the image Processing techniques developed in the Previous chapters 
monochbrome Images. IJhe materTial on color vector Space is a departure from the 
thods discussed in those chapters, and highjights some important dijfferences be- 
en gray-scale and ful-color image processing. The techniques for color-vector pro- 
essing discussed jn the pTevlous Section are TepIesentative of Vector-based processes 
hat inciude median and other order flters adaptive and morphological filters, image 
estoration, image coOmpression, and many others. 
a b 
c d 
FIGURE 6.29 
(alj through 


(d) Segmentation 
of Fig.0.27(al) 
using Option 
manhalanobis， 
In function 
Colorseg with 
了 = 235, 30, 79， 
and 100， 
TeSpPectlvVely. 
Compare with 
Fig. 0.28. 
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7 y) >， 了 (u， We 攻 人 y) 


t、 已 .、. . 


ps .and Ainthese equations are called /fomward and inverse Mra1a3- 
alti01 Kernels， reSpectively. They determine the nature，computational 
omplexity and ultimate usefulness of the trransjoprnrz pair Transjornml coe 护 - 
1 (1 2) can be viewed as the expa1siom coejficients of a series ex- 
ision of 三 with respect to {P That is, the inverse transformation 
iiel defines a set of expazsiom jpctions for the series expansion of 矿 
he discrete Fourler transform (DEFET) of Chapter 4 fits this series expan- 
fformujation wei， In thls case 





太 二 7 一 外 7 了 Pr(eryM+awN) 

VAN 
6 = V-l,*lsthe complex conjugate operator 4 三 0,1 .AM -1 
=01 AN 一 1 Iransformn domalin varlables vv and it represent hori- 
1and vertical frequency, respectively. The kernels are yeparapble Since 


jet J) 二 人 ()o(7) 






























discrele wavelet transfpr (DWT) is the mathematical tool of cholice. In ad A(x) = ef2mtvM and P(y) = ejPmDWVN 
tion to being an efficient, highly intuitive framework for the representatiom N 
and storage of 11XU1resoiutiom images,the DWT provyides Powerful Insight int: 0 
0 snd ortomOr7aal Since 
an image's Spatial and frequency characteristics, IJhe Fourier transform, on th 
other hand, reveals only an image's frequency attributes. 人 1] = 
天 到 站 二 


in this chapter, we explore both the computation and use of the discret 0 otherwise 


wavelet transform，We introduce the Jovelel 7oolpox，a collection 0 
MathWorks'” functions designed for wavejlet analysis but not included 1 
MATLAB'S Zaage Proceysing TDpolpox (IPT),and develop a compatible set 0 
routines that allow basic wavejlet-based processing using IPT alone; that 
without the WaveletToolbox.These custom functions,in combination with IP3 
provide the tools needed to implement all the concepts discussed in Chapter 
of Digital Jrage Processing by Gonzalez and Woods [2002].They are appliedan 
much the same way 一 and Provide a similar range of capabijlities 一 as IPT func 
tions fft2 and ifft2 In Chapter 4. 


Te ) is the inner product operator.The separability of the kernels simpli- 
the computation of the 2-D transform by allowing row-column or column- 


and Inverse kernels to be the complex conjugates of one another (they would 
ldenticai 坟 the functions were real)， 
nlike the discrete Fourier transform, which can be compietely defined by 
ostralghttorward equations that revolve around a single pair of transforma- 
kernejls (given previously), the term diycrele mwayveler transforma refers to a 
as s of transformations that differ not only in the transformation kernels em- 
小 和 加 Background 0yed (and thus the expansion functions usedj, but also in the fundamental 
Consider an image 放 (xz, y) of Size M X N whose forward, discrete transform RN 人 ee 1 人 0 0 
7 (Lu UV ), can be exXpressed in terms of the general relation ie asis) 人 人 0 
0 sojutions are computed). Since the DWT encompasses a variety of 
mlque but related transformations,， we cannot write a single equation that 





(5 
怀 ，》 








where xY and y are Spatial variables and x, 2,..，are 1rasjora do7a0i0 YI 人 < 
apjes.Given 7 (zt ,jx y) canbe obtainedusing the generalized InV6IS di 
discrete transform 


FI formuiation of Chapter 4.a 1/AMN term is placed in the inverse transform equation. Equiv- 
Ht can be incorporated into the forward transform only. or split. as we do here,. between the 
TWward and inverse transtormations as 1 VAN. 
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,is Orthogonal to lts lnteger translates. 

The set of functions that can be represented as a series eXpansion of pj eat 
bwscales or resolutions (ie.,smajl j) is contained within those that can be 
epresented at hligher scales. 

The only function that can be represented at every Scale js Fxz) = 0. 

Any function can be represented with arbitrary Precision as /一 ” co. 


ab: 


FIGURE 7.1 本 
(a) The familiar 


Fourier expansion 

functions are 

Sinusolds of 

vaTyinpg frequency 

and infinite 

duration. 

(by) DWT RS 
expansion 8 
functions are 

“small waves”of 

finite duration 

and varying 二 


frequency. 





ithese conditions are met, there is a companion wavelet 几 that together 
ts integer translates and binary scalings, spans 一 that is, can represent 一 the 
jifference between any two sets of pj)k -Tepresentable functions at adjacent scales、. 


perty 3: Ortpogomality. Ihe expansion functions [ie.{pj cz) form an 
normal of biorthogonal basis for the set of 1-D measurable，square- 
able functions, IJo be called a basis, there must be a unique Set of expan- 
coefficients for eveIry Tepresentable function. As was noted in the 
oductory ITemarks on EourIier kernels, gw = 有 for real orthonor- 


completely describes them all, Instead, we characterize each DWT byatr ernejls 上 or the blorthogonal case， 


form kernel pair or set of parameters that defines the pair. The var 
transforms are Telated by the fact that their expansion functions are “S 
Waves” (hence the name Wavelet) of varying frequency and limited durati 
[see Fig. 7.1(b)]. In the remainder of the chapter we introduce a numbe 
these“small wave”Kkernels. Each possesses the following general properti 


1 了 = 了 


Wo 8 0 0 otherwise 


is called the dual of 六 .For a bliorthogonal wavejlet transform with scalling 


), Tespectively. 
Eroperty 工 Separapiitp caiabiimy and Transiatapility. The kernels can 
Tepresented as three sepafable 2-D wayele 必 


JW (zy) = 几 (X)p(y) 
风 (ty) = ep(z)U(y) 
Wi (xy) = 区 (z) 沙 (7 


Where We (zand Volz are called porizomtal vertical 8 
di48O14L Wavelefy,Tespectively and one separable 2-D scalizzg Fus1action 


)e eXPpIessed as jnear combinations of double-resolution coples of them- 
Selves. That is, Via the series expansions 


二 
人 
| 


一 六 有 (站 V2p(2x 一 万) 


Pt， y) 一 pz)Pp(y) 有 jn 人 (mV2p(2x 志 17) 
上 ach of these 2-D functions is the product of two 1-D real, square-integra 7 


scaling and wavelet functions 
ojk(Z) = 21p(20x 一 大 ) 
WAk(XZ) = 212(207 一 大) 
77273iatiom K determines the position of these 1-D functions along the X-aXIS 
Scale ) determines their width 一 how broad or narrow they are along xz 一 an 
2j2 controls their helght or amplitude. Note that the associated expansl0 


functions are binary scalings and integer translates of 7aotper Wavel 
光 x) = yoo(x) and scaling function p(x) = po o(X)， 


嫩 
“一 
忆 
履 -一 
| 


ere 1 and 1 一 the expansion coefficients 一 are called scajizg and wavetiet 
cto1s, Tespectively. They are the filter coefficients of the jost wayveler 1ra1zs- 
1 (下 WT)，an iterative computational approach to the DWT shown 训 


caling and wavelet vectors 一 the Po( 一 有 ) and Pr 一 玉 ) 一 are 1OwPpaSy and 
hpass decomapositiorm Fifers,respectively. Finally blocks containing a2 and a 
arrow Tepresent dowm154111D1i8 一 extracting every other point from a Se- 
ce of points. Mathermatically, the series of filtering and downsampling 


， eratio ed to compute 全 4 7 2) ip Fig.7.2 is for example， 
Property 关 MUresouton Comapatipib Tihe 1l-D scaling function just intr 2 P y (7 ) 


duced satisfies the following requirements of multiresolution analysis: (7 rm， PP) 三 用 ( 一 ) [用 (一 个) 全 (上 + 1 7 7 -akzo]| =2k kz0 
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0 0 Wavelet vtamily 四 ， wname We 7.1 
Rows avejlet lbox 
和 er bank Each 人 网 Haar haar， haar FWT frteryang 
pass generates one CR 厂 ， oz Daubechles db2 ， db3 ,…， db45， filter family 
DWT scale. In the Rows Coiflets coif， coif1','coif2':,..。'coif5， aiag 
first iteration， WU 十 二 mm 让) Symlets SyYm 'Sym2' ，Sym3',...，:SyYm45， 
全 ,人 (+ 1 PP) = Discrete Meyer dmey ， dmey， 
We 六 /iv( 一 7 yy (让 oa Biorthogonal :bionr， 'bior1.1'，'bior1.3':，'bior1.5'，'bior2.2 
Rows 'bior2.4'，'bior2.6'，'bior2.8'，'bior3.1 
En bior3.3'，bior3.5'，'bior3.7'，'bior3.9'， 
刀 ( 一 四 ) By ( 信 硼 'bior4.4，'bior5.5'，'bior6.8， 
R _Reverse 'rbio， 'rbio1,.t'，rbio1.3:，'rbio1.5'，'rbio2.21， 
| Biorthogonal 'rbio2.4'，rbio2.6'，'rbio2.8'，'rbio3.1'， 
where * denotes convolution. Evaluating convolutions at nonnegative， eve bio3.3 ，rbio3.5 ，rbio3.7 ，rbio3.9'， 
indices is edquivalent to filtering and downsampling by 2. bio4.4 ，rbio5,5' ，rbio6.8 
Each Pass through the filter bank in Fig. 7.2 decomposes the Input imto fou 
lower resolution (Or lower Scale) components. jhe W。coefficients are create 
Via two lowpass (jl.e., ho-based) filters and are thus cajled apProxiraatiom coe 本 
jicients; {Wyfori = 克 ,7,D} are Porizonatal vertical and diagomal detail co With type set to d ，P ，1 or h toobtain apair of decomposition, re- 
eicients,Tespectively Since 让 x, y) ls the highest resolution representation 0 construction, JowPpass， OF or iebpass filters Tespectively 开 this syntax js em- 
the image being transformed, it serves as the 卫 ,(j + Limom input for 也 ployed， a decomposition or ljowpass filter is returned in F1,and its companion 
first iteration.Note that the operations in Fig.7.2 use Heiiper wavelets nor scal placed In F2. 
ing functions 一 only their associated wavelet and scaling vectors. In addition，， Table /TDsts the 下 WT fters included in the Wavelet Toolbox, Their 
three transform domain variabjles are involved 一 scale ) and horizontal an Properties 一 and other useful information on the associated scaling and 
vertical translation,m and zaz.These variables correspond to 忆 2 . .in the firs Wavejlet functions 一 is available 训 the literature on digital fltering and mul- 
two equations of Section 7.1. tiresolution analysis. Some of the more important properties are provided by 
the Wavelet Toolbox's waveinfo and wavefun functions. To Print a written 
FWTs Using the Wavelet Toolbox description of wavelet family wfamily (see Table 7.1) on MATLAB'S 
In thls section, we use MATLAB'S Wavelet IJoolbox to compute the FWT of Command Window for example,enter 
Simple 4 X 4test Image. In the next Section, we will develop custom function 
to do this without the Wavejet Toolbox (ie., with IPT alone).The material here wavelnfo(twfanmily ) 等 角 waveinfo 
[lays the groundwork for thelr development， 二 
The Wavejlet Joolbox provldes decomposition filters for a wide variety 0 atthe MATLAB prompt, Io obtain a digital approximation of an orthonormal 
fast wavelet transforms. The filters associated with a specific transform are ac fansform's scaling and/or warvelet functions, type 
cessed via the function wfilters, which has the following general syntaxX: 
[phi，psi，xval]j = Wavefun(wname，iter) 
% WwWfilters [Lo D，Hi D，Lo _R，Hi R] = wfilters(wname) 
which returns approximation vectors,， phi and psi, and evaluation vector 
7 Here, Input parameter wname determines the returned fllter coefficients in ac .%XVail. 了 ositive integer iter determines the accuracy of the apPIOXimations by 
2 cordance with Table 7.1: outputs LoO_D, Hi_D,Lo_R,and Hi _R ate row vectors ontrolhang the number of iterations used in their comptuitation. For biorthogo- 
2 人 that return the lowpass decomposition, highpass decomposition, lowpass re- al transforms, the appropriate syntax is 
OpDPosedl fo construction, and highpass reconstruction filters, respectively. (Reconstruction 
M47TL4Bor jnage filters are discussed in Section 7.4， ) Frequently coupled filter pairs can alter- [phil，psi1，phi2，psi2，xvall = wavefun(wname，iter) 和 


Procesysig 7Dolbor 


六 oa nately be retrleved Using 


here phi1l and psi1 are decomposition functions and phi2 and psi2 are 
[F1，F2]j = wWfilters(wname，+type ) 洒 。reconstruction functions 
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Haar wavejlet function FIGCURE 7.3 The 
1.5 Haar scaling and 
wavejlet functions， 





Haar scajing function 

























EXAMPLE 7.1: 昌 The oldest and Simplest wavelet transform is based on the Haar scajling 二 
aar flters， wavelet functions. The decomposition and reconstruction fters for a 于 aa 


Scaling, and 
人 based transform are of length 2 and can be obtained as follows: 


>> [Lo_D，Hi D，Lo_R，Hi R] = wfilters( 'haar') 


Lo D = 0.5 
0.7071 0.707] 
Hi D = 0 
--0.7071 0.7071 
Lo _R = -0.5 
0.7071 0.707]1 
Hi _R = -1 
0.7071 -0.7071 
Their Key Properties (as reported by the waveinfo function) and Plots of t 0 0.5 
asSociated scaling and wavelet functions can be obtained USIng 
>> Waveinfol( haar' ) ; Subplot(122); plot(Xxval，pSl， KK' ，Xval，XaxliSs， --KkK )，; 


axis([0 1 -1.5 1.5]); axiS Square 


HAARINFO Information on Haar wavelet， tit1lef('Haar Wavelet Function') 


Haar Wavelet 





ipgure 7.3 shows the display generated by the final six commands. Functions 
计 le, axis, and p1Lot were described in Chapters 2 and 3; function Subp1lLot 
used to subdivide the figure window into an array of axes oOT SUbplots. It has 
efollowing generlic SyntaX: 


General characteristics: Compact1y supported 
WaveJlet，the 0ldest and the Simplest Wavelet， 


scaling function phi 


1on[o1 and 0 otherwise， 





wavelet function psi= 1 on [00.5]，= -on [0.5 1] and 4 
otherwise， 本 
Family 全 H = SubplLlot(m，n，p) or H = SubpLot(mnp) lot 
时 和 和 herem and n are the number of rows and colurmns in the Subplot afTay, Te- 
Orthogonal yes pectively. Both m and n must be greater than |， OPptional output variable H 18 
Biorthogonal yes handle of the subplot (ie., axes) selected by p, with incremental values of p 

Compact Support yes eginning at 1) selecting axes along the top row of the figure window,then the 

DWT possib1le cond row, and so on. With or without H, the pth axes is made the current plot. 

CWT possible ius, the subplot(122) function in the commands given PITeviously Selects 

Support width plot in row 1 and column 2 of a 工 X2Subplot arTay as the current Plot; the 

Filters length Subsequent axis and tit1le functions tben appPly onjy to 让 t. 

RegulLarity haar is not_ continuous The Haar scaling and wavejlet functions shown in Figure 7.3 are discontinu- 
Symmetry YeS ous and comapactty SUPPorted, which means they are 0 outslide a finlte InteTval 

Number of vanishing called the suppo 上 t Note that the support is 1. In addition, the waveinfo data 

momentSs for pSiIi 1 ieyeals that the Haar expansion functions are orthogonal, so that the forward 


可 1 呈 中 师 二 激 
Reference: TI. Daubechies， 1 Inverse transformation keInels are identical， 


Jen Jectures oOn Wavelets， 
CBMS，SIAM，61，1994，194-202 . 


>> [phi，psi，xval] = wavefun( 'haar'，10) ; 

>> XaxliS = zeroSs(Size(XvVal) ); 

>> Subplot(121);j plot(xval，phi，'k'，Xxval， XaXxiS， ' --K ); 
>> axis([0 1 -1.5 1.5]); axis square， 

>> 二 itJIe( Haar Scaling Function' ) ; 


Given a set of decomposition filters, whether user provided or generated by 
the wfilters function, the simplest way of computing the associated wavelet 
transform is through the Wavejlet Toolbox's wavedec2 function. It is invoked 





[C，S] = wavedec2(X，N，Lo_D，Hi D) 
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s0 0n0. lt we Were to extract the horizontal detail coefficient matrix from 


where X1 a2-Dimage or matrix, N is the number of scales to be comptu 
Tct,for example, we would get 


(Le.,the number of passes through the FWT filter bank in Fig.7.2),and 1L4 
and Hi_D are decomposition filters. The slightly more efficient syntax 
1 = 
[C，S] = wavedec2(X，N，wname) 


in which wname assumes a value from Jable 7.1, can also be used. Output d 
Structure [C，S] is composed of row vector C (class double), which contaii 
the computed wavelet transform coefficients, and bookkeeping matrix S (alsl 
class doub1le), which defines the arrangement of the coefficients in C.There 
tionship between C and S is introduced in the next example and describe 
detail in Section 7.3. 


gef [in vector s1(end，: )j. Vectors st1(1，:) and s1(2，; ) contain the 
ss of the computed approximation matrix and three detail coefficient matri- 
Tespectively. The first element of each vector is the number of rows in the 


汉 When the Single-scale transform described above is extended to two scales， 
了 上 XAMPLE 7.2: 油 Consider the following sin8gle-scale wayvelet transform with respect to Ha Pet 
Asimple FWT wavelets: 


using Haar filters. Tc2， sS2] = Wavedec2(f，2， 'haar ') 


>> ff = magic(4) 
Columns 1 through 9 





f = 
16 2 3 13 34.0000 0 0 0.0000 1.0000 
se 11 10 8 -1 .0000 一 1 .0000 1.0000 4.0000 
3 2 6 1< CoJlumns 10 through 16 
公 -4.0000 -4.0000 4.0000 ”10.0000 6.0000 
>> [cl，S1] = wavedec2(f，1，'haar') -6.0000 -10.0000 
ci = 
ColLumns + through 9 1 
17.0000 17 ,0000 17 ,0000 17.0000 ] : 
一 1 .0000 一 [1.0000 1.0000 4,.0000 2 2 
Columns 10 through 16 4 4 
一 4,0000 -4.0000 4.0000 10 ,0000 
-6.0000 -10.0000 te that c2(5:16) = c1(5:16).Elements c1(1:4) ,which were the apPIOXIi- 
mation coefficients of the single-scale transform, have been fed into the filter 
> ank of Fig, 7.2 to produce four 1 X 1 outputs: 砚 。(0,0,.0)， 且 0,0,0)， 
2 2 了 4 (0， 0,0)，and 殉 4(0， 0, 0). IJhese outputs are concatenated columnwise 
4 4 though they are 1 x 1 matrices here) in the same order that was used in the 


Teceding single-scale transform and substituted for the approximation coeftfi- 


导 


ents fom Which they were derived. Bookkeeping matrix s2 is then updated 
0 reflect the fact that the single 2 x 2 approximation matrix in c1 has been 
eplaced by four 1 xx 1 detail and approximation matrices in C2. Thus， 
sc(end ， : ) 1S once again the Size of the original Image,S2(3，;: ) is the size of 
he three detail coefficient matrices at Scale 1,S2(2，:) isthe size ofthe three 
etail coefficient matrices at scale 0 and s2(1，:) ls the Size of the final 


pproxlimation ， 涨 


Here,a4X4magicsdquare fis transformed into a 1 X 16 wavelet decompo- 
sltion vector c1 and 3 X 2 bookkeeping matrix s1.The entire transformation 
Is performed with a single execution (with f used as the input) of the OperTa-: 
tions depicted in Fig.7.2. Four 2 X 2 outputs 一 a downsampled approximation 
and three directional (horizontal, vertical and diagonal) detail matrices 一 ar6 
generated. 上 unction wavedec2 concatenates these 2 X 2 matrices columnwise 
in TOw vector c1 beginning with the approximation coefficients and continuin 
with the horizontal，vertical，and diagonal details. That is ，c1(1) throug 
c1(4) are appIOXImation coefficients 友 o(1, 0.0), 玖 (10), 环 。(1,0,1),an 1 
He 1 1) from Fig.7.2 with the scale of f assumed arbitrarily to be 2; c1(5) 
through c1(8) are 厂 y (0.0)， WU 0)， mb 0 and 克 信 1 和 1 证 


_Io conclude this section, we note that because the FWT is based on digltal 
itering techniques and thus convolution,border distortions can arise.To min- 
lze these distortions, the border must be treated differently from the other 
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臣 = (0:7); 
hd = 1d; hd(end:-1:1) = 056S (1 全 
LIr= lgj 1lLr(end:-1:1) = Jd ; 
hr = Cos(pi * t) .* 1d， 
Case  SYJ4， 


1d = [-7.576571478927333e-002 -2.963552764599851e-002 .,， 

4.976186676320155e-001 8.037387518059161e-001 ，.， 
<“.9785779560527748e-001 -9,.9219543576847226-002 ... 
于 | .260396726203783e-002 3.222310060404270e-002 ] ; 


t = (0:7) 
hd = 1d; hd(lend:-1:1) = coSs(tpl * t) .* 1d; 
LIr= jldij 1lr(end:-f:1) = d ; 


hr = Cos(pi * t) 


人 


CasSe “bior6.8: 


1d = [0 1.908831736481291e-003 -1.914286129088767e-003 ... 


hd = [0 0 0 1,442628250562444e-002 -1.446750489679015e-002 ,.， 
-7”.872200106262882e~002 4.036797903033992e-002 ，,， 
4.178491091502746e-~001 -7.58907729453654286-001 ，,， 


1LPF = 
hr = 


~1.699063986760234e-002 1.1934565279729266-002 


4.973290349094079e-002 -7.72631731672041486-002 
-9.405920349573646e-002 4.207962846098268e-001 .，，， 


8 ,<59229974584023e-001 4.207962846098268e-001 .. 


-9.405920349573646e-002 -7.726317316720414e~002 村 


4.973290349094079e-002 1.193456527972926e-002 .，， 


-1.6990639867602346-002 --1.914286129088767e-003 ，,， 


1.908831736481291e-003] ; 


4.178491091502746e-001 4,036797903033992e-002 ，,， 


-7 ,872200106262882e-002 -1,446750489679015e~002 .，.、 


1.442628250562444e-002 0 0 0 0]; 
t= (0:17); 


CS1p1 yx (十 人 
cos(pi * 七 ) 


wd， 
人 


CasSe “ jpeg9.7， 


Jd = [0 0.02674875741080976 -0,01686411844287495 


-0.07822326652898785 0.2668641184428723 0 


0.6029490182363579 0,.2668641184428723 ，,. 


-0.07822326652898785 -0.01686411844287495 ,,， 


0.02674875741080976] ; 

dj = 
0.5912717631142470 ~1 ,115087052456994 .，,， 
0,.5912717631142470 0.05754352622849957 
-0.09127176311424948 0 0] ; 

tf = (0:9); 

] 六 = cos(pi * (t+T)) 


,.* hd; 
hr = Cos(pi *t) ,* 10， 


otherwise 


end 


errort Unrecognizab1e Wavelet name (WNAME) .' ) ; 


[0 -0.09127176311424948 0.05754352622849957 ... 





















ch 1owerttypef1) ) 


Varargout = {1d，hd} 
1P 
rargout = {Ir，hPr]) 


erwise 
( "Unrecognizabjle filter TYPE. ) 


放 
2 


flters and the highpass decomposition fjter ls a modulated verslion of lts 
wpass Counterpart. Only the lowpass decomposition filter coefficients need to 


vector elements from jast to first with statements like Lr(end: -1:1) =1d. 
odulation ls accomplshed by multiplyimg the components of aknown fjlter by 
Pix*t), which ajternates between 1 and 一 1 as t increases from 0 in IntegerT 
PS For each biorthogonal filter in wavefijilter (ie.， bior6.8' and 
peg9.7'), both the lowpass and highpass decomposition filters are specified; 
Teconstruction fjlters are computed as modulations of them. Finally we note 
the flters generated by wavefilter are of even length. MoreoveT,， ZeTo 
ing ls used to ensure that the lengths of the decomPposition and reconstruc- 
Hijters of each wavelet are identical. 
Given a pair of wavefilter generated decomposition filters, it is easy to 
a general-purpose Toutine for the compautation of the related fast 
let transform. Ihe goal is to devise an efficient aigorithm based on the 起 ]- 
ng and downsampling operations in Fig. 7.2.Io maintain compatibiity with 
he existing Wavelet Toolbox, we employ the same decomposition structure 
je, [CS] where Cis a decomposition vector and S is a bookkeeping matrix)， 
he following routine, which we call wavefast, uses Symmetric image exten- 
ion to reduce the border distortion associated with the computed FWT: 





unction [C，S] = wavefast(xX，n，varargin) 
WAVEFAST Perform multi-1evel 2-dimensional fast Wavelet transform， 
[CE，L] = WAVEFAST(X，N，LP，HP) performs a 2D N-1evel FWT of 


image (or matrix) X With Fespect to decomposition filters LP and 
HP ， 


[C，L] = WAVEFAST(X，N，WNAME) performs the Same 0peration but 
fetches filters LP and HP for Wavelet WNAME USing WAVEFILTER ， 


Scale parameter N must be 1esSs than or 86qual to 10g2 0f the 
maxzimum image dimenslon，FiJIters LP and HP must be even，TO 


Wavefast 
ER 
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reduce border distortion，X isS Symmetrically extended，That is， 
would be {,,， c3 c2 cl1 cl c2 ca .cnocn cn-l cn-2 ...]. 


OUTPUTS， 
Matrix C Is a Coefficient decomposition Vector: 
C=[alnihon)vndni hn-t) ..,，v1) d(1) ] 
where a，h，v，and d are ColumnwiSse vectors Containing 
approximation，horizontal，vertical，and diagonal Coefficient 


matrices，Frespectively. C has 3n + 1 Sections Where n is the 
number of WavelLet decompositions . 
Matrix S 1S an (n+2) X 2 bookkeeping matrix: 

S= [san :); sdln-1，:);，,.,;，Sd(1，:);，SX ] 
where Sa and sd are approximation and detail Size entries， 


See6 al1S0 WAVEBACK and WAVEFILTER， 


oo 


Check the input arguments for reasonab1leness ， 
error(nargchk(3，4，nargin) ) ; 


if nargin == 
if ischar(varargin{f1}) 


[lp，hp] = wavefilter(varargin{f1}+， dd ) 
e]Se 

error( MiSsSing Wavelet name，) ; 
end 


el1Se 

JP = vararginft1l; hp = varargin{21}; 
end 
fl = length(1lp); SX = Size(X)， 


if (ndims(x) -= 2) | (min(SX) < 2) | ~isreal(x) | ~isnumeric(X) 
error( X must be a real，numeric matrix，) ; 
end 


if (ndims(lp) ~= 2) | ~isreal(lp) | ~isnumeric(1lp) .,， 
| (ndims(hp) -= 2) | ~isreal(hp) | ~isnumeric(hp) ,,， 
| (fl -~= length(hp)) | rem(fL1，2) -= 0 
enrror(T LP and HP must be even and equal length real，"”,.， 
huUmerizc filter vectors，.]); 
end 


if -~isreal(n) | -isnumeric(n) | (In< 1) | (n> 10g2(max(SX))) 
error([ AN must be a real Scalar between 1 and ' ,,， 
10g92(max(Size((X)))，]); 
end 





ifX= [cfc2c3..， cn (in 1D)，then its Symmetric extension 








t 
] ， Ss = SX; 


app = double(Xx); 


each decomposition ,,， 

疡 站 站 

Extend the approximation Symmetrically， 
p，Kkeepj】= Symextend(app，f1); 














rows = Symconv(tapp，hp， row' ，fl，keep); 
;0efS = SYmconv(rowS，hp， 'col'，f1L，keep) ， 
= [coefs(:)， Cc]; Ss = [SizefCcoefS); Ss]; 
0efs = SYymconv(rows，]1p， col' ，f1，keep); 
[coefsS(:)” Cc]; 






With HP and LP to get the horizontal and next approximation 
“ Coefficients ， 


oefs = symconv(rows，hp， Col ，Tfl1，keep) ; 
= [coefs(:) CC]) 
pp = SyYmconvy(rowSs，lp， 501 ，Tf1，keep ) ; 


Unction [y，keep] = Symextend(Xx，f1) 
ompute the number of coefficients to Keep after convolution 
nd downsampling，Then extend x in both dimensions. 


eep = floor((f1 + Size(X) - 1) / 2); 
padarray(x，f(f1L ~- 1) (fl1 -1)]，  Symnmetric'， "both' ) ; 


unction Y = Symconv(X，h，type，f1l1，keep) 
Yonvolve the Prows or columns of X With h， downsamp]e ， 
and extract the Center section since Symmetrically extended ， 


Strcmp{type，'row' ) 

= Conv2(X，h) ; 

二 人 hd) 

yl fL/A2+1:fLIA2+Kkeep(2)); 


= Conv2(X，h' ) ; 
y(1:2:end， 1;) 
于 二 本 有 机 和 


二 


上 


) 
fl /2+keep(1)，:); 
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Convolve rows With Hb and downsamp1e，Then convolve columns 
with HP and LP to get the diagonal and vertical coefficients. 


询 
人 
本 


2 


the StartlIng Output data structures and initial approximation， 


3 5 
二 和 
< 
COrIV2 
=Conv2 (A, 日 ) 
Perforrzs [pe 2- 门 
COHYOLNOF OF ad- 
tricegAancdB. 


2D8 


EXAMPELE 7.3: 
Comparing the 
execution times of 
wavefast and 
WavedecCc2. 
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As can be seen in the main routine,only one for ljoop, which cycles thr 
the decomposition levels (or scales) that are generated, is Used to orchest: 
the entire forward transform computation. For each execution of the loo 
current apPIoximation image, app, which is initially set to x, is SyYmmetri 
extended by internal function symextend,This function cajlls padarray,W 
was introduced in Section 3.4.2,to extend app in two dimenslions by mirr 
flecting f1--1ofits elements (the length of the decomposition flter mint 
across its border， 党 

Function symextend returns an extended matrix of apProxlmation coe 诈 
cients and the number of pixels that should be extracted from the center 
any Subsequently convolved and downsampled results. IJhe rows of th 
tended apProximation are next convolved with highpass decomposition 
hp and downsampled via symconv. TIhis function is described ln the follo 
Paragraph. Convolved output, rows,is then submitted to symconv to convo 
and downsample its columns with filters hp and Lp 一 generating the diagg 
and vertical detail coefficients of the top two branches of Fig. 7.2. These Tes 
are inserted into decomposition vector c (working from the jast elemen 
ward the first) and the process js repeated in accordance With Fig. 7.2 to 8 
erate the horizontal detail and approximation coefficients (tbhe bottom 
branches of the figure)， 

Function symconv uses the conv2 function to do the bujk of the transfao 
computation work. It convolves filter h with the rows or columns of x 
pending on typej, discards the even indexed TOwWS or columns (te., doOwnsaI 
pies by 2), and extracts the center keep elements of each Iow of colu 
Invoking conv2 with matrix x and row filter vector h initiates a TOW-by 
convohution: using column filter vector hn ' results in a columnwise convoluti 


S2] = 
十 OC | 


Wavefast(f，n，wname ) 


mpare the FeSsults ， 
0O=t2 1/ (reftimne + eps) 
diff = abs(max(cf 一 C2) ) ; 










廿 The following test routine uses functions tic and toc to compare the 6 f= imnread('Vase'， tif ); 


Wavefastt: 


jaxdifference = 
:3.2969e~012 


function [ratio，maxdiff] = fwtcompareftfT，n，wnanme ) 

%FWTCOMPARE Compafre Wavedec2 and wavefast， 
[RATIO，MAXDIFF] = FWTCOMPARE(F，N，WNAME) compares the 0perati 
of toolLlbox function WAVEDEC2 and custom function WAVEFAST， 


FE Image to be transformed ， 
N Number of Scales to0 Compute ， 


5 
站 
和 TINPUTS: 
区 
5 内 NAME Wavelet to USe ， 
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FICGURE 7.4 

A5l2 X 3Sl2 
Image of a vase， 


t transform anhnd computation tine for wavefast ， 


he 512 X 512 image of Fig. 7.4 and a five-scale wavelet transform with 
espect to 4th order Daubechies” wavelets, fwtcompare ylelds 


[ratio， maxdifference] = fwtcompare(f，5， 'db4') 


golbox counterpart while Producing virtuaHy identical resujts. 剖 


5 OUTPUTS : 
各 RATIO 
折 MAXDIFF 
上 5 Get transform and computation time for wavedec2 ， 
下 了 

[c1，sS1] = wavedec2(f，n，wname ) ; 

PFeftlme = 七 DCj 


Execution time ratio (CuUStom/tooLbox ) 
Maximum Coefficient difference ， 


he wavejet transformation functions of the previous two sections produce 
ondispiayapie data structures of the form {cS$},， where cis atransform coeft- 
clentt vector and S js a bookkeeping matrix that defines the arrangement of 
efficients in c.To process images, we must be abje to examine and/or modify 
this section, we formally define {cy S$},examine some of the Wavejlet Iool- 
functions for manipulating it, and develop a set of custom functions that 
be used without the Wavelet Toolbox. These functions are then used to 
ld a general purpose routine for displaying c. 
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The representation Scheme introduced in Example 7.2 integrates the ct 
cients of a Immultiscale two-dimensional wavelet transform into a Singl 
dimensional vector 


c=[ANG)” HxO 丽人 ”VD Di (CD Di(: 


where AN ails the approximation coefficient matrix of the Nth decompos 
level and Hu, Vand Di for = 1 2,,..Nare the horizontal， Vertical an 志 
agonal transform coefficient matrices for level z_ Here, Hi 人 for examp 


the row vector formed by concatenating the transposed columns of matr ox = appcoef2(c1，s1，'haar') 


That 1s, 下 
60.0000 
邢 ; = 1 46 rizdet2 = detcoef2('h'，cl1，s1，2) 

,0e-013 * 

3 0 -0.2842 

] O 0 

了 Hi() = 9 and Hi( =[3 1 一? 5 ewc{1 = WwWthcoef2( h' ，c1，s1，2) ; 
ewhorizdet2 = detcoef2('h'，newcft，s1，2) 


Orizdet2 = 
0 0 


Because the equation for c assumes N decompositions (or passes through 0 0 


filter bank in Fig.7.2),e contains 3N + 1 sections 一 one approximation and 
grOouUPs of horizontal, vertical, and diagonal details. Note that the highest S 
coeftftlcients are coOmputed when i = 1i; the lowest scale coefficients are ass 
ated withz = NA.Thus,the coefficients of e are ordered 和 om low to high scal 旬 

Matrix $ of the decomposition structure is an (N + 2) X 2 bookkeep 训 | 
array of the form : 

















Te a three-level decomposition with respect to Haar wavelets is performed 
an8 X 8magic sgquare using the wavedec2 function. The resulting coeffi- 
enht Vector c1,is of size 1] X 64. Since s1 is S X 2, we know that the coeffi- 
ts of c1 span (W-2)=(3 一 2)=3 decomposition levels Thus, 让 
ncatenates the elements needed to populate 3N + 1 = 3(3) +1T = 10ap- 
ximation and detail coefficient submatrices. Based on s1, these submatri- 
cude (aj) al X 1 approximation matrix and three 1 X 1 detail matrices 
composition level3 [see s1(1，:) and s1(2，:)],(b)three2 X 2 detail 
rlces for level2 [see s1(3，: )],and (c) three4 X 4 detail matrices for level 
See S1(4，: )].The fifth row of s1 contains the size of the original image ff， 
atriX approx =260 is extracted from c1 using toolbox function appcoef2， 
h has the following syntax: 


SS =|saw; Sdnw; sd -1 … sd …， sdj; sf 


whbere SaN,sSdqi, andsfare1l X 2vectors containing the horizontal and ver 
dimensions of Nth-level approximation ANV, ;th-level details (Bu Vi and. 
forz = 1,2,...W),and original image ,respectively The information in Sec: 
be used to locate the individual approximation and detail coefficients in: 
Note that the semicolons in the preceding equation indicate that the elemen 
of 9 are Organized as a column vector. 






hic 












a = appcoef2(c，S，wname) appcoef2 
EXAMPLE 7.4: 疆 TIhe Wavelet Toolbox provides a variety of functions for locating, eXtrat 1 


Wavejet IJoolbox 。 ing,reformatting, and/or manipulating the approximation and horizontal， ere, Wname is a wavelet name from Table7.1and aisthe returmed approxi- 


。 tical， and diagonal coeffhcients ofcasa functlon of decomposition level. We Jal on matrIX. The horizontal detail coefficients at level 2 are retrieved using 

人 Introduce them here to illustrate the concepts just discussed and to PIep3I detcoef2， a function of similar syntax 

decomposition the way for the alternative functions that will be developed in the next secti 

Vector c. Consider, for example, the following sequence of commands: d = detcoef2(0，c，s，m ooefa 


>> 和 = magic(8) | 
>> [cf1，S1] = Wavedec2(f，3，'haar')， 
>> Sizel(cl) 


FEWhich o is set to hv，or 'd' forthe horizontal， vertical and diagonal 
etalls and n is the desired decomposition level, In this example,2 X 2 matrix 
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浊 汪 wthcoef2 


Wavework 


站 0 iotv mrnweroeroanerpat ev yoiyw 


horizdet2 1S returned， The coefficients SOITesponding to horizdet2 in ci 
then zeroed Using Wthcoef2,a wavelet thresholding function of the form: 


nc = wWthcoef2(type，c， SO 


.3.1 Editing Wavelet Decomposition Coefficients 
without the Wavelet Toolbox 


Without the Wavelet Toolbox， bookkeeping matrix S is the key to acce 
the Indlvidual approximation and detaijl coefficients of multiscale vector 
thls section, we use S to build a set of Seneral-purpose routines for the m 
ujation of ec. Function wavework is the foundation of the routines devel 
whlch are based on the familiar cutrcopy-paste metaphor of modern word 
cessing applications. 


function [varargout] = Wavework(opcode ， 七 
Ype，C，S，n，xX 
5WAVEWORK is used to edit Wavelet 人 


和 5 [VARARGOUT] = WAVEWORK(OPCODE ， TYPE，C，S，N，X) gets the 

5 coefficients specified by TYPE and N for a jj On - 
CCes 

ee S or modification : 

和 5 

%5 INPUTS: 

站 OPCODE 0peration to perfornm 

5 COPY' [varargout ] =Y = requested (via TYPE and N) 

% Coefficient matFix 

CUt [varargout] = [NC，Y] = New decomposition Vector 

(with requested Coefficient matrix zeroed) AND 

5% requested Coefficient matphix 

5 paste [varargout] = [NC] = new decomposition vector with 

% Coefficient matrix replaced by X 

和 5 

和 TYPE Coefficient category 

先 ， Approxinmation coefficientg 

移 四 Horizontal details 

和 5 CEV Vertical details 

到 :dl Diagonal details 

站 

揪 [4，S] ls a wavelet toolbox decomposition stFucture， 
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is 3a decomposition level (Ignored if TYPFE =  a )， 
is a two-dimensional coefficient matrix for pasting， 


ee al1S0 WAVECUT，WAVECOPY，and WAVEPASTE , 
{nargchk(4， 6，mnargin) ) ; 


dins{c) -= 2) | (size(c，1) -= 1) 
ror( Cnmust be 3a _ row Vector， ) ; 


dims{(S) -= 2) | ~isreal(S) | ~isnumeric(S) | (Size(S，2) ~= 2) 
ror('S must be a real，numeric two-COlumn array，) | 


= prod(S，2) ; % Coefficient matrix elementS ， 


(Cj < elementSs(end)) | ,,， 
(elementSs(1) + 3 * Sum(elements(2:end - 1)) >= elementSs(end ) ) 


or(f' [CS] must form a standard wavelet decomposition 和 
' Structure，]) |; 


argin < 5 
= 十 和 Default 1Level 1Ss 1 ， 


生 Maximu Levels in [C，93]， 


= (]ower(type(1)) == 'a ); 
aflag & (n0 > nmax) 
Pror('N exceeds the decompositions in [C，S]，); 


% Make pointers Into C， 


= 十; stop = elementSs{(1); ntst = nmax ; 
Vi， d} 

wltch type 
 ，offset = 0 % 0ffset to details 


YV'，offset = 


| 
ee 


2 
(mm 
P 
全 
人 
下 
- 趟 
nn 
CD 
人 一 
中 
f) 


= Sl1zZe6(S，1) 一 % Index to detail info， 
= elements(1) + 3 * Sum(elementSs(2:nmax 一 + 1)) + ，，， 
offset * elLements(nlndex) + 1 ; 
start + elements(nindex) ~ 1 
= 


error( TYPE must begjin With "ah VOor dd， 


203 
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WavecUt 


二 
& 


Switch Jower(0pcode) 


casS 


Cas 


oth 


end 


As Wavework checks its input arguments for reasonableness, the number 
elements in each_ coefficient submatrix of c is computed Via element 
prod(s，2). Recall from Section 3.4.2 that MATI ABR function Y = pr 
DIM) computes the products of the elements ofX along dimension DIM.Th 
Switch statement then begins the computation of a pair of Pointers to the 
efficients associated with input parame 
case (ie case 'a'),the computation i 
at the start of c (so pointer start is Ji the ending index， pointer stop， 
number of elements in the appPTOoximation matrix, which is elements (1).W 
a detail coefficient submatrix is Tequested, however, start is compnute 
SUmming the number of elements at all decomposition levels above n 训 
adding offSset * elements(nindex); where offset js 0, 1,or 2 for the hg 
ZzZontal, vertical or diagonal coefficients， reSpectivejl 
to the row of s that corresponds to input parameter n. 

Tbe Second switch statement in function wavework performs the ope 
tion requested by opcode. Forthe 'cut' and Copy ' cases, the coefficients 
c between start and stop are copled into y, which has been Preallocated 
two-dimensional matrix whose size is determined by s. Jhis is done using 


昌 = 


The following three functions 一 wavecut, wavecopy, and Wavepaste 一 
Wavework to manipulate c using a more intuitive Syntax: 


吉 DO redquested action， 
e { copy'，'cut'} 
yY = repmat(0，s(nindex，:) ); 
y(:) = cf(start:stop)， nc = 
1f Strcmp(Jower(opcode(1:3))，'cut， 
nctstart:stop) = 0; varargout = 
el1Se 
Varargout = {YT} 


cy; 

) 
{nc，yY}; 
enad 


Be “paste， 
if prod(size(xX)) -= elements(end 一 ntst) 


error( X is not Sized for the requested paste， ) 


el1Se 

nC = C; 
end 
erw1lse 
error( Unrecognized 0PCODE，: ) ， 


ncf(Start:Stop) = X( 1:)， 


function [nc，y] = wavecut (type，C，S，D) 


5WAVECUT Zeroes coefficients in a wav 
[NC，Y] = WAVECUT(TYPE，C， S，N) returns a new decomposition 
Vector whose detail or approximation coefficients (based on TY 


总 


务 
抱 


】》 


varargout = {fncl; 


ters type and n. For the approxima 
S triVvial since the coefficients are al] 


Yand nindex is a po 


elet decomposition structure. 
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and N) have been zeroed， The coefficients that were zeroed are 
returned in Y， 


“TYPE Coefficient category 
8， Approximation coefficients 
:hh Horizontal detail1s 

LV Vertical detail1s 
Diagonal detailSs 


[C，3S]j ls a Wavelet data structure . 
N Specifies a decomposition level (1ignored if TYPE = 'a')， 


See als0 WAVEWORK，WAVECOPY，and WAVEPASTE. 


pr(nargchk(3，4，nargin) ) ; 
argin == 4 
nc，y] = Wavework('cut'，type，C，s， n) ; 


Wavework( 'cut'，type，C，S)， 


志 
CC2? 
< 必 
1 

自 


| 


Wavecopy(type，c，S，Dn) 
FCOPY Fetches coefficients of a Wavelet decomposition structure， 
Y = WAVECOPY(TYPE，C，S，N) returns a Coefficient array based on 


人 -十 
术 、 
心 
亏 
用 


wavecopy 
3 


_TYPE coefficient category 
站 Approximation coefficients 
hh Horizontal details 
2 Vertical detail1s 
dd Diagonal details 


[0C，8] is a wavelet data structure， 
清 Specifies 8 decomposition level (ignored if TYPE = a )， 


pf(nargchk(3，4， nargin) ) ， 
argin == 4 
YY = WavewWork( 'copyY ' ， type，C，S，n)，; 


y = Wavework( copy' ，type， C，S); 


Wavepaste 
将 





- EPASTE Puts coefficients in a wavelet decomposition Structure ， 
= WAVEPASTE(TYPE，C， S，N，X) returns the new decomposition 
structure after pasting X into it based on TYPE and N.， 
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玫 XAMBPLE 7.5: 
Manipujating“e 
with wavecut and 
WaveCcopy. 


cocoooopospoosoeduodouo do de 














[C，9S] 1Ss a Wavelet data Structure ， 

N specifies a decomposition level (Ignored jif TYPE = 'a') 

X is a two-dimensional approximation or detail coefficien 
matrix whose dimensions are appropriate for decomposition 
evel N. 


See 3alSO WAVEWORK，WAVECUT ，and WAVECOPY， 


error(nargchk(5，5，narginy) ) 
nc = Wavework( paste' ，type，C，S，n，X)，; 


滥 Functions wavecopy and wavecut can be used to Ieproduce the WW 
Joolbox based results of Example 7.4: 


>> ff = magic(8); 


>> [c1，S1] = Wavedec2(f，3，'haar')) 


>> approx = Wavecopy( 'a'，c1，Ss1) 0 or 1 
approx = AS 
260.0000 汪 2 
>> horlzdet2 = Wavecopy('h'，cf，sS1，2) 
horizdet2 = 0 
1.0e-013 * RE 
0 ” -0.2842 append， 
0 O 
>> [newc+，horizdet2]】 = wavecut('h'，c1，s1，2); Image W: 


>> newhorizdet2 = Wavecopy( 'h' ，newcft，sS1，2) 
newhorizdet2 = 

0 0 

0 


Note that all extracted matrices are ldentical to those of the Previg 攻 
example. 





;zz Displaying Wavelet Decomposition Coefficients 


As was indicated at the start of Section 7.3, the coefficients that are Pa 
into one-dimensional wavelet decomposition vector c are,in reality,the c0 
cients of the two-dimensional output arrays from the filter bank in Fig.7.2 
each literation of the filter bank,four quarter-Size coefficient arrays (negle: 
any expansion that may result from the convolution process) are Produ 人 8 


WAVE2GRAY Disp1lay Wavelet decomposition Coefficients， 
iEE WAVE2GRAY(C，S，SCALE，BORDER) displays and returns a 
Wavelet coefficient image， 


71.3 得 Working with Wavelet Decomposition Structures 


an be arranged as a2 X2array of submatrices that replace the two- 


Wave2gray(C，5) ; Display W/defaults . 
foo = Wave2gray(Cc，5) | Dispjlay and return， 

foo = Wave2gray(Cc，S，4) | 

foo = Wave2gray(Cc，S，-4) | Magnify absolute Values， 
foo = Wave2gray(Cc，S，1， 


Magnify the details， 


append ) ; Keep border values 


Detall Coefficient SCcaling 


MaxImum range (default) 
Magnify default by the Scale factor 
Magnify absolute Values by abs(Scale) 


Border between Wavelet decompositions 


Border replaces jimage (defaujt) 
Border Increases Width of image 


aln) hn ) 
2 htn-1) | 
| vtm ad) | | 
| von-) | don-) | 
v{n-2) dtn-2) 


207 


0， ensional Input from which they are derived.Function wave2gray performas 
人 Rs ibimage compositing 一 and both scales the coefficients to better reveal 
人 ifferences anid inserts borders to delineate the approximation and vari- 
Re TiZontal， vertical, and diagonal detail matrices. 
人 Vertical detaills 
计 Diagonal details ion W = Wave2gray(C，S，Scale，border) 0 


人 
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Here，n denotes the decomposition step scale and a，h，v，{ a 


= Wavecopy( vV ，cd，S，IL); 
approximation，horizontal，vertical，and diagonal detail 


ad = WS -SizeflV); frontporch = round(pad / 2)， 


oO oo oo 


CoefficientSs，freSspectively， 


s% Check input arguments for reasonablLeness， 
error(tnargchk(2，4，nargin) ) ; 
if (ndimns(c) ~= 2) | (Size(c，1) -= 1) 

error( CC must be a row vector.'); end 


f (ndims(S) -= 2) | ~isreal(S) | ~isnumeric(S) | (size(sS，2) -= 2) 


error( S must be a real，nunmeric two-column array，) 1; end 


eJLements = prod(S，2); 
if (length(c) < elements{end)) | .,， 


~(elements(1) + 3 * Sum(elements(2:end - 1)) >= elements(end)) - 


error([ [IC 3S] must be a Standard Wavelet ' ,,， 
-decomposition Structure. 1]); 
end 


If (nargin > 2) & (~isreal(scale) | ~isnumeric(scale)) 
error( SCALE must be a real，numeric Scalar.:); 

end 

if {nargin > 3) & (~ischar(border) ) 
error( BORDER must be Character String.'); 


end 
if nargin == 2 
Scale= 1; 55 Default scale， 
end 
if nargin < 4 
border = "absorb'; 点 Default border. 


end 


s5 Scale coefficients and determine pad fill， 
absfJlag = Scale < 0 
Scale = abs(Scale); 
If Scale == 
SCale = 1 ; 
end 


[cd，w] = wavecut( 'a'  ，Cc，s); WwW = mat2gray(w) ; 
cdx = max(abs(cd(:))) / scale; 
if absf1lag 

cd = mat2gray(abs(cd)，[0，cdx]); fil1l = 0， 
elTSe 

cd = mat2gray(cd，[-cdx，cdx]); fil1 = 昌 s 
end 


s5 Build gray image one decomposition at a time. 
for = Size(S，1) -- 2:-1:1 
WS = Size(w); 


= Wavecopy('h'，cd，s，); 
d = WSs 一 Sizelhy); 
= padarray(h，frontporch，fil11， :pre'); 


h 
pa 
h 
h = padarray(h，pad -~ frontporch，Tfil1， "post'); 























frontporch = roundf(pad / 2); 


= padarray(V，frontporch，Tfill， pre ) | 

= padarray(vV，pad - frontporch，fil1l， post ' ) ; 
:d = Wavecopy('d'，cd，s，i)， 

:pad = WS 一 Sizel(d); frontporch = round(pad / 2)， 
dd = padarray(d，frontporch，fil1l， 'pre' ) ; 
d=padarray(d，pad - frontporch， fi11， "post ' ) ， 
各 Add 1 plIxel white border， 

Switch 1 0wer(border) 

Case append ' 

WwW = padarray(w，[1 1]，1， "post')， 

h = padarray(h，[1 0]，1，'post'); 

v = padarray(v，[0 1]，1， 'post':)， 

casSe “abSorb， 


WwW(:，end) = 1 w(end，:) = 1 
h(end，:) = 1T; vV(:，end) = 1]; 
otherwise 


error( Unrecognized BORDER parameter ，) ; 
end 


w= [whyv dl]; % Concatenate Coefs. 


nargout == 0 


Imshow(w) ; 名 DisplLay resulLt， 


The “help text”or header section of wave2gray details the structure of gen- 
iated output image w.The subimage in the upper left corner of w, for instance， 
he approximation array that results from the final decomposition step. It is 
ounded 一 in a clockwise manner 一 by the horizontal， diagonal, and vertical 
ail coefficients that were generated during the same decomposition.The re- 
nlting 2 X2 arTay of Subimages is then surrounded (again in a clockwise 
anner) by the detail coefficients of the Prevlous decomposition step; and 
-Pattern continues until al of the scales of decomposition vector c are 
nded to two-dimensional matrix w. 
e compositing just described takes place within the onjy for loop in 
8<gray.Atter checking the inputs for consistency wavecut is called to re- 
ye the approximation coefficients from decomposition vector c.These coe 仁 - 
ts are then scaled for later display using mat2gray. Modified decomposition 
tor cd (ie.,c without the appIoximation coefficients) is then similarly scaled. 
positive values of input scale,the detail coefficients are scaled so that a co- 
ficient value of 0 appears as middle gray; all necessary padding is performed 
和 ha fil1 value of 0.5 (mid-gray). If scale is negative, the absolute values of 
detall coefficients are displayed with a value of0 corresponding to black and 
pad fil1l value is set to 0. After the approximation and detail coefcients 
been scaled for display, the first iteration of the for loop extracts the last 
ReCOmposition step's detail coefficients from cd and appends them to w (after 
dding to make the dimensions of the four Subimages match and insertion of a 
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下 叉 AMPLE 7.6: 
Transform 
coefficient disPiay 
Usling wave2gray. 


3 
b :c: 
FICURE 7.5 
Displaying a two- 
Scale wavejet 
transform of the 
Image In 上 18g, 7.4: 
(a) Automatic 
Scaling; 
(b) additional 
scaling by 8; and 
(c) absolute 


values Scaled by 8. 
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， . ing along the borders of the level 1 coefficient Subimages: it was insert- 
one-pixel white border) viathew= [whjiyvd] statement. TIhls Process is then 世 ding 8 8 


peated for each scale in c.Note the use of wavecopy to extract the VarIlous detal 


oefficients needed to form w 6Ss. FigurIe 7.3(c) Shows the effect of taking the absolute values of the 
E 


atails. Here, all padding is done in black. 罗 


跨 The following sequence of commands coOmputes the two-Scale DWT of th 
image in Fig. 7.4 with respect to fourth-order Daubechlies wavelets and di 
Plays the resujlting coeffticients: 


有 The Inverse East Wavelet Iransform 


e its forward counterpart, the inverse josl ayvelel rra1zsjorpl can be com- 
>>f= imread( "vase,tif ) ) ted iteratively using diglital flters. Figure 7.6 Shows the required yymtpesis OF 
>> fc，s] = wavefast(f，2， O15S1rUCHiO Jier pdaK, which revyerses the Process of the analysis or decom- 
>> Wave2gray(c，sS); sition flter bank of Fig. 7.2. At each iteration, four scale j approximation 
>> figurej Wave2gray(Cc，S，8) d detall Subimages are Psamzpled (by inserting zeroes between every 
>> figurej Wave2gray(Cc，S， 一 8 ment) and convolved with two one-dimension filters 一 one operating on the 
iniages ” columns and the other on lts TOws. Addition of the results yields 
:Scale ) 二 1 apProxlmation, and the Process is repeated until the original 
ge IsS reconstructed. Ihe filters used in the convolutions are a function of 
he wavelets employed in the forward transform. Recall that they can be ob- 
ined from the wfilters and wavefilter functions of Section 7.2 with input 
araImeter type set to “Pr ' for "reconstruction.” 

When using the Wavelet Toolbox, function waverec2 is employed to compute 
Inverse 上 WT of wavelet decomposition Structure [C, S].Itis invoked using 


The images generated by the 





g = Waverec2(C，S，wnanme ) 
here g ls the resulting Teconstructed two-dimensional image (of class doub1e). 
hie Tequlred reconstruction filters can be alternately Supplied via syntax 


g = Waverec2(C，S，Lo R，Hi _R) 


olowing custom routine, which we call waveback, can be used when the 
yejlet Toolbox is unavailable. It is the final function needed to compilete our 
ayelet-based package for Processing images in conjunction with IPT (and 


fthout the Wavelet Toolbox). 


Rows 
(along 7) (+ Hz) 
Columns 
Ca (Gong 
Rows (人 到 (十 157,0) 
Wo 
Rows 


人 
Columns 


0 


1.4 壮 The Inverse East Wavelet Transform 


证 
放 


二 加 Waverec2 


FIGURE 7.6 The 
2-D FWT” filter 
bank. The boxes 
with the up 
afrrOwWS TepPIesent 
upsamplng by 
inserting ZeroOes 
between everIy 
element， 


2 


2 


WavebackK 


tt 罗 hareuayrae 六 轴 放 Recet 


Chapier 7 器 Wavelets 


function [varargout] = waveback(c，S，varargin) 

2%WAVEBACK Performs a multi-1evel two-dimensional inverse FWT. 
[VARARGOUT] = WAVEBACK(C，S，VARARGIN) computes a 20 N-1Level 
partial OF complete Wavelet reconstruction of decomposition 
structure [C，S]， 


SYNTAX: 
Y = WAVEBACK{(C，S，'WNAME ' ) ; 
Y = WAVEBACK(C，S，LR，HR) ， 


output inverse FWT matrix Y 
USing Lowpass and highpass- 


HR) or filters obtained by 


[NC，NS] Output new Wavelet 


[NC，NS] 


WAVEBACK(C，S， 'WNAME' ，N) ; 
WAVEBACK(C，S，LR，HR，N) ; 


所 


[NC，NS] after N Ste 
reconstruction. 


祖 oo ybo 


See also WAVEFAST and WAVEFILTER. 


Check the input and output arguments for reasonab1leness， 
error(nargchk(3，5，nargin) ) ， 
error(nargchk(1，2，nargout ) ) ; 


if (ndims(C) ~= 2) | (size(c，1) -= 1) 
error( C must be a row vector.:)， 
end 


if (ndims(S) -= 2) | -isreal(s) | -~isnumeric(s) | (Size(s，2) ~= 2) 
error( 3S must be a real，numeric two-column arFray， ) ; 
engd 


eJements = pfrod(S，2); 
if (length(c) < elements{end) ) 


~(elements(1) + 3 * Sum(elements(2:end - 1)) >= elements(end)) 


error([ [IC S] must be a standard wavelet : .，.， 
decomposition Structure.'])， 
end 


5 Maximum 1evels in fC，9S]. 
nmax = Size(S，1) 一 2; 
光 Get third input parameter and init check fl1ags， 


wnanme = Varargin{tj; filterchk = 0;) nchk = 0; 
SWitch nargin 
case 3 
if ischar(wname ) 
[lp，hp] = wavefilter(wnane，'F); mn= nmax ; 
el1Se 
error( Undefined filter，)， 
end 


if nargout ~= 1 





















reconstruction filters (ER and 


calling WAVEFILTER with 'WNAWME: 


decomposition Structur 


7.4 器 The Inverse East Wavelet Transform 


-error( Wrong number of output arguments.') ， 


[lp，hp]} = wavefilter(wname，'r'); 
ns=yvarargin{f2}; nchk = 1; 
e156 


lp = Varargin{t1}+; hp = varargin{f21; 
filterchk = 1 nn = nmax ; 
if nargout -~= 1 
error( Wrong number of output arguments.') ; 
end 


= Yararginf1} hp = varargin{2}， filterchk = 二; 
= Varargin{f3)}; nchk = 1 ; 
therwise 


rror( Improper number of input arguments.)， 


_ length(1p) ; 
ilterchk 移 Check filters， 
(ndims(Ip) ~= 2) | -isreal(lp) | ~iSnumeric(1p) .,. 
| (ndims(hp) ~= 2) ~isreal(hp) | ~isnumeric(hp) .,， 
| (f1 ~= length(hp)) | rem(f1，2) -= 0 
error(f LP and HP must be even and equal length real，' ,,， 
numeric filter vectors.']); 


1 


nchk & (~isnumeric(n) | -isreal(n)) % Check scale N， 


error('N must be a real numeric,')， 


nn> nmax) | (n < 1) 
s error( Invalid number (N) of reconstructions requested， ) ; 


On ~= nmax) & (nargout -~= 2) 
error( Not enough output arguments.' )， 


芭 : nsS = S; nnmax = 
『 工 = 4 
了 二 a new approximation， 
= Symconvup (Wavecopy( 'a' ，nc， 
Symconvyup (wavecopy( h' ，nc，ns，nnmax)，. ,， 
hp，1lp，f1，ns(3，:)) + ，， 
Symconyup (Wavecopy( vV' ，nc，ns， nnmax)，，,,， 
1p， hp， f]， ns(3， : ) ) + ，， 
Symconvup (Wavecopy( 'd' ，nc，ns， nnmax)，.，,， 
hp，hp，f1，ns(3，2:) )) 


nmax ; 各 Init decomposition， 


nsS)，jJp，1p，fl，ns(3，:)) + ,,， 


7 
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s% Update decomposition， 
nc = ncl(4* prod(ns(1，:)) + 1:end) nC 
ns = ns(3:end， 1:) 帮 S 
nnmax = Size(fnSs，1) 一 2; 

end 


fa(:) ncli 
[ns(1，?:) 


有 


ion of the test function in Example 7.3: 


ction [ratio，maxdiff]j = ifwtcompare(f，n，wname) 


% For complete reconstructions， reformat output as 2-D， COMPARE Compare waverec2 and waveback， 


]f 9 == 四 本 RATIO，MAXDIFF] = IFWTCOMPARE(F，N，WNAME) compares the 

a=ncj nc=repmat(0, ns 3)) nc) = ai peration of Wavelet Toolbox function WAVEREC2 and custom function 
end AVEBACK， 
varargout{1t = nc 8 
if nargout == NPUTS 

varargout{2} = ns; 下 Image to transform and inverse transform、 
2 AN Number of Scales to Compute， 
CR _WNAME Wavelet to USe， 


function z = Symconvup(x，f1，f2，fln，Kkeep) 
% Upsample rows and convolLve columns With ff+;，upsample columns and 


碎 


.RATIO Execution time ratio (Custom/toolbox) ， 


% convolve rows With f2，then extract Center assuming SymmetrIcal MAXDIFF ， | 
% extension， Maximum generated image difference， 
ompute the transform and get outpu 
Y = zeros([2 1] .* Size(Xx) ) ; yY(1:2:end， :) = X; 0 9 tput and computation time for 
Y = COonv2(yY，1f1 ) ; 人 
。 二 e 和 
z = zeros([1 2] .* Size(y) ); z(:，13:2:end) = yj 加 
ee waverec2(ct，8s1，wname ) ; 


= zlfln- 1:fln+keep(1) 一 2，fjln 一 1:fln + keep{2) 一 2) 和 二 : 浊 站 
了 


te the transform and get output and computation time for 
acCK， 


S2] = Wavefast ( 千 ， 站 ，Wnanme ) ; 


The main routine of function waveback is a Simple for loop that iter 
through the requested number of decomposition levels (i.e., scales) in thed 
sired _ reconstruction。 As can be Seen，each loop calls internal functlg 
symconvup four times and Sumas the returned matrices. Decomposlition Vet 
nc, which is initially set to c,is iteratively updated by replacing the four coef 
cient matrices passed to symconvup by the newly created appIoximatigo 
Bookkeeping matrix ns is then modified accordingly 一 there is now one 让 
scale in _ decomposition structure [nc，ns]. This sequence of Operatlong: 
slightly different than the ones outlined in Fig.7.6,in which the top two In 忆 
are combined to yield 


[YY 7171， 后 # 户 凡 DT) 十 厂 %(， 111， 1) 个 全 六 彤 (AT 出 及 所 用 有 


where 人 122 and 122 denote upsampling along ma and m, TesSpectively Functg 
waveback uses the equivalent computation 


[YY2C 站 全 见 (四 宇 lm 十 [至多 (让 mm 站 全 he 人 mm)] 个 


Function symconvup performs the convolutions and upsampjing require 
compnute the contribution of one input of Fig. 7.0 to output 到 oU + 工 Pa 站 ) 
cordance with the proceding equation. Input x is first upsampled in the TOV de 
tion to yield y, which is convolved columnwise with filter f1.The resujlting ou 刀 
which replaces y,is then upsampled in the column directon and convolved IO 
row with f2 to produce z. Finally,the center keep elements of z (the final co 


iution) are returned as input xs contribution to the Dew approximation. 


a tve scale transform of the 912 X 512 Image in Flg. 7.4 with respect to 4th-order 
0bechies” wavelets, we get 


Patio， maxdifference] = ifwtcompareftfT，5， db4 |) 


3.6948e-013 


te that the inverse transformation times of the two functions are equijvalent 
he Tatio is 1) andthat the largest output difference is 3.6948 X 10-53. For 
Iactical puUrposes,they generate identical resujts in identical times. 派 


e following test routine compares the execution tiimes of Wavelet Tool- 
名 nction waverec2 and custom function waveback using a simple modifi- 
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EXAMPLE 7.7: 
Comparing the 
execution times of 
waveback and 
Waveprec2. 
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了 上 XAMDPLE 7.8: 
Wavelet 
directionality and 
edge detection. 


下 上 入 AMPLE 7.9: 
Wavejlet-based 
Image Smoothing 
or blurring. 
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Wavelets in Image Processing 


As in the Fourier domain (see Section 4.3.2), the basic approach to wavyel 
based Image PITocessing ls to 


人 


二 


1。Compnute the two-dimensional wavelet transform of an image. 
2 Alter the transform coefficients. 


3 Compnute the Inverse transform 
“at 中 


4 ]] 
0OO (人 聊 : 人 
咱 攻 一 


入 扩 和 入 引 和 二 


Because scale in the wavelet domain is analogous to frequency in the Fou 
domain, most of the Fourierbased filtering techniques of Chapter 4 hav 
equjvalent“wavelet domain "counterpart, In this section, we use the Preced 
three-step procedure to give several examples of the use of wavelets in imi RN 人 | 
PTrocesslng. Attention is restricted to the routines developed earjlier in: 0 罗 (下 
chapter; the Wavejlet IJoolbox is not needed to implement the examples 也 
here 一 nor the examples in Chapter 7 of Digitael 17paage Processizzg (Gonz 
and Woods [2002]). 


野 Consider the 500 X 500 test image in Fig. 7.7(a). This image was usedi 人 和 
Chapter 4 to jlustrate smoothing and sharpening with Fourier transfor 引 区 8 
Here, we use it to demonstrate the directional sensitivity of the 2-D wa ee 
transform and its usefulness in edge detection: 


>> 咎 = imread('A.tif') ; 


>> 1lImSshow( 了 f) ; 人 5 
7 5 


>> [Cc，S] = Wavefast(f，1， 'Synm4' ) |; 瑟 
>> figure;i wave2gray(Cc，s，-6)， 
>> [nc，y] = Wavecut( 'a' ，c，sS) | 


站 | 
有 几 由 下 和 
入 人 凯 宙 过 攻 攻 《w 站 人 


>> figure; wave2gray(nc，Ss，-6) |; 
>> edges = abs(Waveback(nc，S，'Synm4') ) ; 
>> figure; imshow(mat2gray(edges) ) ; 

der symlets j 18 Shown in Fig. 7.8(b), where it is clear that a four-scale decom- 


0 ition pas been performed.To streamjline the Smoothing process, we employ 
lowing utijity function: 


The horizontal，vertical，and diagonal directionality of the singjle-S 
wavejet transform of Fig. 7.7(a) with respect to 'sym4' wavelets is clearl 
ible in Fig. 7.7(b). Note, for example, that the horizontal edges of the ori8 于 
image are present in the horizontal detail coefficients of the upper-right qu 六 
rant of Fig.7.7(b).The vertical edges of the image can be similarly identified 
the vertical detail coefficients of the ljower-left quadrant. To combine this 世 
formation into a single edge image, we simply zero the approximation co 
clients of the generated transform, compnite its inverse, and take the abs 
value. TIhe modified transform and resujlting edge image are showrn 


ction [nc，g8] = wavezerofc， S，]，Wnanme ) 

REzERO Zer0eSs Wavelet transform 上 全 和 漳 Coefficients ， 

[NC，G8] = WAVEZERO(C，S，L，WNAME) zeroes the level L detail 
coefficients in wavelet iiGS Oh Structure [C，S] and 
computes the resulting inverse transform With reSpect to WNAME 
avejlets， 


Rigs.7.7(c) and (dj,respectively A similar procedure can be used to isolat foo] = wavecut('h'，c，s，1); 
vertical or horizontal edges alone. foo] = wavecut(v' ，nc，s，1)， 
fo00] = wavecut('d'，nc，s，1); 


加 Wavelets，like their Fourier counterparts，are_ effective instrumentfsS 
smoothing or blurring images. Consider again the test image of 肌 g. 7. 丰 
whblich ls repeated in Fig. 7.8(a). Its wavejlet transform with respect to fo 下 





Waveback{(nc， S，Wname ) ; 
im2uint8{(mat2gray{Ii 工 ) ) ; 


Wure; imshow(g8); 
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二 
:2 计 


FICURE 7.7 
Wavelets in edge 
detection: 
(ajASimple test 
imagei (by) its 
waveiet 
transformi (c) the 
transform 
modified by 
ZeTOoing ajl 
appIoxlimation 
coefficlents: and 
(d) the edge 
image resulting 
from computing 
the absolute value 
of the inverse 
transforrm. 


WavezBifo 
和 
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sing Wavezero, a Series of increasingly Smoothed versions of Fig. 7.8(a) 





e generated with the following sequence of commands: 
c 寺 

e 法 

FIGURE 7.8 = imread( Atif ) 


iC，S]】 = Wavefast(f，4， Synm4 ' ) ; 
image Smoothing: Wave2gray(C，S，20) 

(aj) Atest Image; es 2 8 人 3 [CC，g8] = Wavezerotc，S，1， Sym4 ) 
(bj its wavelet 人 RCR | woCO，g8] = Wavezero(Cc，S，2， 'Symn4 ') 
transformi (Cc) thbe 和 人 人 ffC，g8] = Wavezero(Cc，S，3，'Sym4:) 
四 四国 本 国 So 寻 Tc，g8] = wavezero(C，SsS，4，'sSym4') 
first-level detaljl 0 国 遇 本 国 风 本 人 
0 and 吉大 2 和 te'that the smoothed Image in Fig.7.8(c) is only slighty blurred, as it was ob- 

tnrou 让 相 - 

和 SI resujts 
after zeToling the 
second-,third-， et De 
and fourth-level 和 5 Ding the second level detall coefficients as well, The coefficient zeroinpg 
detaijls. ON 的 ess COntinues in Fig, 7.8(e), where the third level of details is zeroed, and 


Wavelet-based 


We 


Am 昌  Sfa 四 


in the wavelet domain and frequency in the Fourier domain. 串 


nsider next the transmission and reconstruction of the four-scale wavelet EXAMPLE 7.10: 
Orm in Fig. 7.9(a) within the context of browsing a remote image data- ”Progressive 

or a Specific image. Here, we deyiate from the three-step procedure de- ITeconstruction. 
d at the beginning of this section and consider an application withonut a 
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f = Wavecopy( a ，C，S); 多 Approximation 1 
figurej imshow(mat2gray(f) ) 


[C，S] = Waveback(Cc，S， jpeg9.7 ，1) ; % Approximation 2 
f = Wavecopy( 'a' ，Cc，S) | 

figureji imshow(mat2gray(f) ) ; 

[c，s] = Waveback(Cc，S， jpeg9.7 ，1 ) ; %% Approximation 3 
f = WaveCcopy( 'a ，C，S) |; 

figure; imshow(mat2gray(f) ) ; 

[C，S] = Waveback(Cc，S， jpeg9.7' ，1) 务 Approximation 4 
= Wavecopy( a'，Cc，S); 

igurej;i imshow(mat2gray(f) ) ; 

Cs] = Waveback(Cc，S， jpeg9.7' ，1); %5 Final image 

= Wavecopy( a ，C，S); 

igure; imshow(mat2gray (了 f) ) ; 





ite that the final tour approximations Use waveback to perform single level 
Onstructions. 二 





cause the Image Processing Toolbox does not include routines for computing or using 
jet transforms, a significant portion of this chapter is devoted to the development 
et of functions that extend the Image Processing Toolbox to wavelet-based imag- 
Jhe functions developed were designed to be fully compatible with MATLABS 
jet TIoolbox, which is introduced in this chapter but is not a part of the Image 
cessing Joolbox. In the next chapter, waveiets will be used for image compression, an 
area in Which they have received considerable attention in the literature. 





FIGURE 7.9 Progressive reconstruction: (a) A fourscale wavelet transformi; (b) the fourtk 
level approximation image from the upper-left corner: (cj a refined approximation incof 
porating the fourth-level details; (d) through (f) further resolution improvemen 
Incorporating higherlevel details. 





Increasing resolution. This progressive reconstruction PTrocess is easily simujat 
ed using the following MATLAB command Sequence: 


>> = imnread('Strawberries.tif'); 先 Generate transfor 
>> [fc，S] = Wavefast(f，4，'jpeg9.7')， 
>> Wave2gray(Cc，S，8); 
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ograms as 包 ough they were conventional Mr-files or built-in functions， 
onstrates that MATILAB can be an eftfective tool for prototyping Image 
ImPIesSion Systems and algorithms. 


| 峡 Background 


s can be Seen in Fig. 8.1, image compression Systems are composed of two 
tinct Structural blocks: an emrcoder and a decoder. Image 丰 xz, yj is fed into 
he encoder, which creates a Set of symbols from the input data and uses them 
Tepresent the Image. If we let 7 and 712 denote the number of information 
arIying units (usually bits) in the original and encoded images, respectively， 
the_ compression that is achieved can be quantified numerically via the 
HiD1eSSi1O7 7G8O 


好 2 


ata set. In MATLAB,the ratio 本 the nunmber of blts used In the represen- 
ation of two image files and/or variables can be computed with the follow- 
ng M-function: 


PremT1ez0 


Jmage compression addresses the problem of reducing the amount of datar 
qulred to represent a digital image. Compression is achieved by the removalo 











one Or more of three basic data redu1ada1tcies: (1) codznag reduradarcy, whichis unction cr = imratio(f1，f2) imratio 
Present when less than optimal (ie.,the smallest length) code words are used TIMRATIO Computes the ratio of the bytes in two images/variab1es， ee 
(2) iniempixel reduadacy, which 二 fom corTelations between the Pixels CR = IMRATIO(F1，F2) returns the ratio of the number 0f bytes in 

of an image; and/or (3) psycpovisual redundanrcy, which is due to data that 半 variables/fiIlLes Fl1 and F2. If F1 and F2 are an original and 

ignored by the human visual system (ie.,visually nonessential information). 于 compressed jmage，respectlvely，CR 18 the compresslon rati0， 

thlis chapter,， we examine each of these Tedundancies, describe a few of the rror(nargchk(2，2，nargin) ) ; %% Check input argumentSs 

many techniques that can be used to exploit them, and examine two important = bytes(f1) / bytes(T2) ; 和 Compute the ratio 

compression standards- 一 JPEG and JPEG 2000. These standards unify th 和 

concepts Introduced earlier in the chapter by combining techniques that co Unction b = bytes{(Tf) 

jectively attack all three data redundancies. -Return the number of bytes in input f, If 下 is a string，asSume 

Becatuse the Image Frocessing Toolbox does not include functions 诺 :that it is an image fiLlename，jif not，it is an image variab1le， 

Inage compresslon,a major goal of this chapter is to provide practical ways 0 

exploring compression techniques within the context of MATLAB. For i 

Stance, we develjop a MAILAB callable C function that illustrates how to ma: 5 | ee 
nipulate Varlable-length data TePIesentations at the bit level, Thls 1s important 人 人 加 0 
because variable-length coding is a mainstay of image _ compression， but System blocK 
MAILAB is best at processing matrices of uniform (i.e., fixed length) data; JR ComprIessed diagram. 
During the development of the function， we assume that the reader has 8 Image Ra 
working knowledge of the C language and focus our discussion on how to 

make MAILAB interact with programs (both C and Fortran) external to the 

MAILAB environment. This is an imnportant skill when there is a need to 问 

terface M-functions to Preexisting C or Fortran Programs, and when vector- Symbol Inverse 六 


decoder mapper 


zed Mr-functions still need to be speeded up (e.g., when a for loop can not be 
adequately vectorized). In the end, the range of compression functions deveL 
oped lin this chapter, together with MATLAB'S ability to treat C and Fortrag 


se。 刘 M。 和 art。 Ar 一 


Decoder 
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if ischar(f) 
info = dir(f); b = info.bytes， 
elSeif isstruct(f) 
5 MATLAB 's whos function reports an extra 124 bytes of memory : 
#% per Structure field because of the way MATLAB stores 
% Structures in memory。 Don't count this extra memory， instead 
# add Up the memory associated with each field， 
b = 0; 
fields = fieldnames(f) ; 
for k = 1:Jength(fields) 
b = b + bytes(f.(fields{fk})); 


1 loy the dynamilc Structure fieldname syntax to set and/or get the contents 
tructure field F, respectively 

TD view and/or use a compressed (ie.,encoded) image， it must be fed into a 
oder (See Flg. 8.1, where areconstructed output image, (xz, y), is generated. 
eneral， 六 (xx 》) may or may not be an exact representation of F(x， y). Ifit is， 
system 1 calied error 广 ee 了 儿 rzrzatiomr preserving, or 1ossjess: 让 not some 
1 of distortion is Present in the reconstructed Image. In the latter case, which 
called /ossy CO71IP7essiom, We can define the error e(x, y) between (zx, y) and 
y),forany value ofxz and y as 





end e(z,y) = jx,y) 一 (xz,y) 
el1Se . 
info = Whos('f')， b = info.bytes; at the total error between the two Images 1S 
end M-1 N-l 、 
> 六 [Fx, y) ss 三 切 )j 
X=0 yy 一 


For example, the compression of the JPEG encoded image in Fig. 2.4(c) 区 
Chapter 2 can be computed via 了 be 713 (rooft-71aea11-89LG7e) error emms between F(x， y) and F(xz, y) is the 


iare root of the squared error averaged over the M xx N arIITay, Of 


>> 上 二 imratio(imread('bubbles25.jpg')， bubbles25.jpg') 下 
一 | -一 一 攻 ， 过: 尾 ， 
ou | 高 马 总 en- 


35.1612 
efollowing Mr-function computes emms and displays (这 en。 夫 0) both e(X, 7y) 
Note that in function imratio, internal function b = bytes(f) is designed nd lts histogram, Since e(x, y) can contain both positive and negative values 
t Iather than imhist (which handles only image data) is used to generate 
nonstructure variable. 开 fi a nonstructure variable ,function whos intr e histogram. 


dir performs a similar service. In the Syntax employed, dir returns a structure flnction rmse = compare(f1 ，f2，Scale) compare 
0MPARE Computes and disp]lays the error between two matrices，. ee 
RMSE = COMPARE(F1，F2，SCALE) returns the root -mean-Square error 

between inputs F1 and F2， displays a histogram of the difference， 

and displays a Scaled difference image，When SCALE is omitted，a 

Scale factor of 1 is used， 





Check input arguments and set defaults， 
rortnargchk(2，3，nargin) ) ; 
nargin < 3 


友人 
和 





二 允 作 癌 dnames fieldnames is used to Ietrleve alist of the fields in f, and the statements 


SCale = 1 ; 
names = for Kk = 1:Jength(fields) 
fieldnames(s) re- b = b + bytes(f， (fields{fk}) ); Compute the oot -mean-Square erFor. 


Iz73 Q Cell array OF 
S1711183 COHMIGiig 


= double(f1) - doublef(f2)， 


je Structure Fieldl Perftorm the recursions. Note the use of di1G11Nic StrUcture jieladzaraeyin there 2 间 辣 :) 2) 1 (mx n))， 
cUISlVe calls to bytes.IfSis astructure andFisa string variable containing a 2 羽生 


Output error image & histogram if an error (i,e,.，rmse -= 0) . 
f rmse 

:多 Form error histogram， 

emax = max(abs(e(:)))， 

[h，x] = hist(e(:)，emax); 


fbeld name, the statements 


下 
亡 
人 中 
盖 
心 
| 
< 
aa 
| 
re 
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If Jength(h) >= | 


Tigurej bar(X，h，'K2)， (mo 1 





Code 2 





Codef (nm 


名 Scale the error image Symmetrically and display 
emax = emax / SCale; 
e = mat<cgray(e，[-emax，emax] ) ， 
figure; imshow(e) ; 
end 


end 
js, Zavg 三 丰 when7alssubstituted for 太 和 ), TIhen the constant ra may be 


outside 也 e summation，leaving only the sum of the Dr(rm) for 
大 三 也 which,of course,equajls 1.Asis lustrated in Table 8.1,coding re- 
Undancy is almost always present when the gray levels of an jimage are 
gded Using a natural binary code. Im the table, both a fixed and variable- 
gth encoding of a four-level image whose gray-level distribution is shown 

lumn 2 is given. The 2-bit binary encoding (Code 1) in column 3 has an 
age length of 2 bits. TIhe average nuUmber of bits requjred by Code 2 (in 


mn 5) js 


Finally， we note that the encoder of Fig. 8.1 is responsible for reducinpg 村 
coding， Interpixel, and/or psychovisual redundancies of the linput image. In 节 
first stage of the encoding process,the 7124PPer transforms the input image init 
a (usually nonvisual) format deslgned to reduce interpixejl redundancies 了 
Second Stage, OF 9UC1EZer block, reduces the accuracy of the mapper's out 
In accordance with a predefined fidejlity criterion 一 attempting to elimi 
onjly psychovisually redundant data.This operation ls iTrreversible and mu 
omltted when errorfree compression is desired. In the third and fina] Stage 
the process, a yy1abpol coder creates a code (that reduces coding redundan 


tor the quantizer output and maps the output in accordance with the code 一 六 P(K)P(re) 
| avg 7 


如 
[| 
广 。 
避 
己 
忆 
ee 
人 
六 
SS 
SS 
QQ 
“ 
《D 
(内 
C 
(7 
一 
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《> 
JR 
0 
已 
《D 
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1 
(LDO 入 


RE 
2 
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Let the discrete random variable 六 forK = 三 1 2,... ,了 with associated Prob 

bilities Pr( 六 ) represent the gray jlevels of an 世 -gray-level image， As 

,mcorresponds to gray level0 (since MATLAB arTay indlices can 
euUjand 





Coding Redundancy 


2 njorrzatiomr teory Provides the mathematical framework to answer this 
related questions. fts fundamental premise is that the generation of infor- 
ion can be modeled as a probabilistic process that can be measured in 3 
ner that agrees with intuition. In accordance with this supposition, a ran- 
m event 忆 with Probability 已 ( 达 ) is said to contain 





1 

7 兢 ) = log FE) 一 log 已 ( 歼 ) 
Units of information. If P( 已 ) = 1 (that is the event always occurs), 7( 刁 ) = 0 
and no information is attributed to it. That is, because no Uncertainty is associ- 
ated with the event, no information would be transferred by communicating 

hat the event has occurred. Given a source of random events from the dis- 
ete Set of possible events {al,az，. ,01J} with associated probabilities 
(ai), P(az)，P(aj)}, the average information per Source output, called 
eePtropy of the source, 1S 


total number of bits required to code an M X N Image is MANLavg. 
这 
殖 = 一 之 Paij log 已 (ai 


When the gray levels of an image are represented using a natural mm-bit Di 
Dary code, the right-hand side of the preceding equation reduces to 117 bits 
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TABLE 8.1 
jlustration of 
coding redundancy: 
Lawe = 人 2ftor 

Code 1; Zav = 1.81 
for Code 2. 


8.2 训 Coding Redundancy ”289 































288 Chapfer 8 本 Image Compression 


全 119 ”168 168 
119 119 107 119 
0o7 107 119 119 


hist(f(:)， 8) ; 
p / Sumtp) 


于 an image is interpreted as asample of a gray-jevel source” that emitte 
we can model that source's Symbol probabilities using the gray-leve R 
togram of the observed image and generate an estimate, called the 户 rsr-or 
est1141e, 杂 , of the Source's entropy: 


瓦 = 一 之 (m)log 忆 (rr 
We 0.1875 0.5 0.125 0 0 0 0 0.18z5 
5uch an estimate is computed by the following M-function and, under the 起 
Sumption that each gray level is coded Independently, is a lower boun 
the compression that can be achieved through the removal of coding redi 
dancy alone，. 2 


de 2of Jable 8.1, with Zavs 汪 1.81, approaches this first-order entropy esti- 


eand is a minimal length pizary code for image f.Note that gray level 107 
sponds to 六 and corresponding binary codeword 01 1 in Iable 8.1, 119 


entropy function h = entropy(X， mn ) 
#ENTROPY Computes a first-order estimate of the entropy of 8 matrix 





H= ENTROPY(X，N) returns the first-order estimate of matrix 
5 With N symbols (N = 256 if omitted) in bits/symbol，The estinat erresponds to 疡 and code ]2, and 123 and 168 correspond to 010， and 00，， 
5 asSumes 3a statistically independent source characterized by the SpectiVejly. 府 
relative frequency of occurrence of the elementSs in X， ， 
error(nargchk(1，2，nargin) ) ; 多 Check input arguments 21 Huffman Codes 
if nargin < 2 

n = 256; % Default for mn hen coding the gray levels of an image or the output of a gray-level mapping 
end eration (Pixel differences, run-lengths, and so on), 五 xj 肪 zar codey contain 
x = double(x)， % Make input double e:Smajjest possible number of code Symbols (e.g., bits) per source Symbol 
xh = hist(x(:)，n)， % Compute N-bin histogranm gg， gray-level Value) Subject to the constraint that the source Symbols are 
xh = xh / sum(xh(f:) ); % Compute probabilities do71e Qt 111e， 

人 要 Jhe first step ip Hufftman's approach is to create a series of source reduc- 
a48 nask t0 eliminate 0 S since 10g2(0) = -inf， by ordering the probabilities of the symbols under consideration and 


G8 
ii 之 


find(xh) ， 


j bining the lowest probability symbols into a Single Symbol that replaces 
h = -Sum(xh(iz) .* 10g2(xh(i))); % Compute entropy 


Im in the next source Teduction. Figure 8.2(aj ilustrates the process for the 
ay-level distribution in Table 8.1.Atthe far left the initial set of source Sym- 
8 and their probabijlities are ordered fronm top to bottom in terms of de- 
easing probability values. To form the first source reduction ,the bottom two 
Tobabilities, 0.125 and 0.187$, are combined to form a “compound symbol” 
ith Probability 0.3125. This compound symbol and its associated probability 
e placed in the first source reduction column so that the probabilities of the 
uced source are also ordered from the most to the least probable. This 
Drocess is then repeated until a reduced source with two syInbojls (at the far 


Note the use of the MATLAB find function which is empjoyed to determ 
the indices of the nonzero elements of histogram xh.The statement find(x 
equlvalent to find(Xx ~= 0). 上 unction entropy uses find to create a vecto 
of indices, ji, into histogram xh, which is Subsequentiy employed to elimin 
all zero-valued elements from the entropy computation in the final statement 
革 this were not done,the J0g2 function would force outputh toNaN (0* 一 :int 
1S of da MUHtper) when any symbol probability was 0， 


也 XAMDPLE 8.1: 喷 Consider a simple 4 X 4 image whose histogram (see p in the following 

frst- code) models the symbol probabilities in Table 8.1. The following command 

jine sequence generates one such image and computes a first-order estimate 0 
Its entropy. 


_Jhe second step in Hufftmans procedure is to code each reduced Source， 
ting with the smallest source and working back to the original source. The 
minimal length binary code for a two-symbol source, of course, consists of the 
Symbols 0 and 1. As Fig. 8.2(b) shows, these Symbols are assigned to the two 
symbols on the right (the assignment is arbltrary; feversing the order of the 0 


>> ff = [119 123 168 1419; 123 119 168 168 ] ; 
[fi 119 119 107 119; 107 107 119 119] 


and 1 would work just as welD).As the reduced source symbol with probability 
05 was generated by combining two symbols in the reduced source to its Jeft， 
the 0 used to code it is now asSigned to bo 态 of these symbols and a 0 and 1 


are arbitrarily appended to each to distinguish them from each other This 


WV 
V 
- 直 


119 “123 168 119 
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FIGURE 8.2 

了 uftman (3a) 
source reduction 
and (bj code 
asSignIment 


PIOcedures. 


NU 和 fman 
5 










OFresSponding index of P. 


Original Source Source Reduction 


Symbol Probabjility 1 2 








ased on huffman5 by Sean Danaher，University 0f NorthumbPria， 


jewcast1e UK. Available at the MATLAB Central File Exchange: 
ategory General DSP in Signal Processing and Communicat1ions . 









0.5 0.5 0.5 
ad 0.1875 0.3125 0.5 
al 0.1875 村 0.1875 时 


0.123 











heck the input arguments for reasonab1leness ， 
-nargchk(1，1，nargin) ) ; ， 
dims(p) ~= 2) | (min(size(p)) > 1) | -isreal(p) | -isnumeric(p) 
for('P must be a real numeric Vector，); 













Original Source Sotrce Reduction 





celLl(length(p)，1) % Init the glLobal cell array 


(p) > 1 % When more than one Symbol ,,， 


Symbol 








05 1 05 1 



































he input probabilities 
as 0.1875 00 0.3125 01 05 0 pP / Sum(p) % NormalIize 
0.1875 We = Preduce(p) 务 DO Huffman SouUrce Symbol reductions 
iakKecode(Ss，[]); 点 Recursjvely generate the Code 
OpE 人 % Else，trivial one Symbol casel 


operation is then Tepeated for each reduced source until the OTlginal Sour 
Teached. IJhe final code appears at the far left (column 3) in Fig. 8.2(b)， 

Jhe Huttman code in Fig.8.2(b) (and Table 8.1) is an instantaneous UniqU 
ly decodable block code. It is a biock code because each source Symb< 
mapped into a fixed sequence of code symbols, It is 访 siamatameors becal 
each code word in a string of code symbols can be decoded withont referer 
ing Succeeding symbols. That is, in any given Hufftman code, no code wordi 
Prefix of any other code word.And it is xzaiguely decodapie becausea String 
code Symbols can be decoded in only one way.Thus, any string of Huffman 
coded symbols can be decoded by examining the individual Symbols of 
String in a left-to-right manner. Forthe4X4 Image in Example 8.1, a top-t 
bottom left-to-right encoding based on the Huffman code in Fig. 8.2(b) Vi 
the 29-bit string 10101011010110110000011110011. Because we are using 
instantaneous uniquely decodable block code, there is no need to insert de 
iters between the encoded pixels.A left-to-right scan of the resulting stringr 
veajs that the first valid code word is 1, which is the code for Symbol cz OF8g 
level 119. The next valid code word js 010, which corresponds to gray jevel 1< 志 
Continuing in this manner, we eventuajlly obtain a compjletely decoded ima 儿 
that is equivalent to f in the example, 

The source reduction and code assignment procedures just described 
Implemented by the following M-function, which we call huffman: 0 % 


tion s = reduce(p); 

ate a Huffman Source reduction tree in a MATLAB Cell Structure 
performing Source Symbol reductions untiIl there are OnJIyY 二 WO 
Uced Symbols remalning 


cel1(length(p)， 1 ) ; 


eenerate aa Starting tree With Symbol nodes 1，2，3，..， to 
ference the symbol probabilities， 

奔 = 1:1ength(p) 

{ = 


ile numel(S) > 2 
p，i] = Sort(py); % Sort the Symbol probabilit1eSs 
1 p(2); 5%5 Merge the 2 lowest probabilLities 
] ; % and prune the Lowest one 
和 5 


S=S(i); Reorder tree for new probabilities 
;s{2} = {s{1}，s{f2}}， sand merge & prune its nodes 
[ 


a 
sS(1) = % to match the probabilitzies 


can the nodes of a Huffman Source reduction tree Fecursively to 
enerate the indicated variable Length code Words， 


function CODE = huffman(p) 

<HUFFMAN Builds a variable-lLlength Huffman code for a Symbol Source 
和 CODE = HUFFMAN(P) returns a Huffman code asS binary Strings in 
5 Cell array CODE for input Symbol probability vector P，Each word 
各 in CODE corresponds to a Symbol whose probability is at the 





1obal variable surviving all recursSive Cal1sS 
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deGriyvalenl ex- 
PressiO 8 X = 
cell([m,n]j). or 
oOier forrzy 由 pe 
>> help cel1. 

























ty matrix. That is, X{1} refers to the contents of the first element (an 
y) of X;X(1) refers to the element itself (rather than its content), Since cej 
ys can be nested within other cel arrays, the Syntax X{1}{2} refers to the 


if isa(SC， 'CelLl') 5 For cell array nodes， 
makecode(scf1}，[codeword 0T); % add a 0 ithe 1St elemen 
makecode(scf2}，[codeword 1])， %oralifthe 2nd 
e1Se 5 For jleaf (numeric) nodes 
CODE{Sc} = char('0' + codeword)ji 点 create a char code String: 败 


end fter CODE is initialized and the input probability vector is normalized 


. 计 p=py/sSsSum(p) statementj,the Hufftman code for normalized proba- 
command line Sequence uses huffman to generate the cod ity vector p js created in two steps. The first step, which is initiated by the 
ig. 8.2: 


>> P= [0.1875 0.5 0.125 0.1875] ; 8.2(a). In reduce, the elements of an initially empty source reduction 


>> C = huffman(Pp) array S， Which is sized to match CODE, are initialized to their indices. 
性 一 

OOT: 

1 1 

0D10， 

00 上 


Note that the output is a variable-length character arTay in which each row 
String of 0s and 1s 一 the binary code of the coITespondingly indexed sympbo 
p. For example，'010' (at array index 3) is the code for the gray level v 
Probability 0.125. 


ere output y 1s the sorted elements of x and index vector 1i is such that 
X(i). When p has been sorted, the lowest two probabilities are merged by 
1 jacing thelr composite probability in p(2),andpf(1) is Pruned.TIhe Source re- 

了 弛 e opening lines of huffman, input argument p (the InPut Symbol prot tion cell array is then reordered to match p based on index vector iusing s 
abijity vector of the symbols to be encoded) is checked for reasonableness 证 让 .Finally, s{2} is replaced with a two-element cell array containing the 
Siopa/ 7C7IQDLe CODE 1S initialized as a MAILAB cell array (defined Therged probability indices via s{2} = {s{f1}，s{21} (an example of content 
Section “2.10.6) with Ilength(p) rows anda single column,AllMATLAB 吕 f dl ing), and cell indexing is employed to prune the first of the two ImeTged 
al variables must be decjlared in the functions that reference them usin ments, s(1),vias(1) =[].The process is repeated until only two elements 
Statement of the form aa 


gurIe 8.3 shows the final output of the process for the symbol probabilities 


9jobal X Y 2 able 8.1 and Fig.8.2(a). Figures 8.3(b) and (c) were generated by inserting 


This statement makes variables X, Y, and Z available to the function in whidhl 
they are declared, When several functions declare the Same global variab 
they share a single copy of that variable. In huffman, the main routine and i 
ternal function makecode share global variable CODE. Note that it is customa 
to capitallize the names of global variables. Nonglobal variables are 1ocal wa 
ables and are available only to the functions in which they are defined (not 
other functions or the base workspace)jithey are typically denoted in lowercas 

In huffman,CODE is initialized Using the cell function, whose Syntax 1S 


Celldispl(s); 
CellLplot(s); 


S{1}T{T = 4 
s{1}j{f2jf1} =3 
s{t1}j{f2}{f2} =1 
Ss{2} = 2 


X = Cell(m，n) 







It creates an 117 X 姑 array of einpty matrices that can be referenced by cell of 
by content. Parentheses“()”are used for ce1 idexin8g; cUIIy braces“{}” 9 
Used for comtert ipdexinmg. Thus， Xx(1) = [] indexes and removes element: 
from the cell array while xf{t1} = [] sets the first cel array element to 


ER 







AAA、 
ER 
站 

二 关 人 人 和 
SR 


和 
| 这 和 





| 

! 

0.125 0.1875 
| 





一 一 -G4 3 | 台 ? 
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ED ec: 

FIGURE 8.3 
SoOUrce reductions 
of Fig.8.2(a) using 
function huffman: 
(aj binary tree 
equlvalent; 

(b) display 
generated by 
Cellplot(s); 

(c) cell1disp(s) 
output. 
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between the last two statements of the huffman main routine.MATLAB fune 
tion celldisp Prints a cell array's contents recursively; function cel1lpl 
Produces a graphical depiction of a cell array as nested boxes.Note the one- 
one corTespondence between the cell array elements in Fig. 8.3(b) and 
SOurce Iceduction tree nodes in Fig.8.3(a)j: (1) each two-way branch in the 全 
(which represents a Source reduction) corresponds to a two-element cejl 3 
in s; and (2) each two-element cell array contains the indices of the sym 
that were merged in the corresponding source reduction. For exXample， 
merging of Symbols ai and al at the bottom of the tree Produces the { 
element cell arTray s{1}{2}, where sfijf2}{t1y=3andsf1}{f2}{2} = | ( 刀 
indices of Symbol as and al, respectively).The root of the tree is the top-leve 
two-element cell array S. 

The final step of the code generation Process (i.e., the assignment of co 
based on Source reduction cell array s) is triggered by the final statement ， 
huffman 一 the makecode(s，[]) call. This call initiates a recrryive code 3 
Signment process based on the procedure in Fig, 8.2(b)， 人 Although recursi 
generally provides no Savings in storage (since a stack of values beiie 
processed must be maintained Somewhere) or increase in Speed,it has the at 
vantage that the code is more compact and often easier to understand， par 
ularly when dealing with recursively defined data structures jike trees A 
MATLAB function can be used recurslvejlyi; that ls, it can call itself either d 
rectiy or indirectiy When recursion is used, each function call generates a f 
Set of local variables, independent of all previous sets, 1 

internal function makecode accepts two inputs: codeword, an array of 0 
and ls, and sc, a Source reduction cell array element. When sc is itself a ce 
arTay, lt contains the two source Symbols (or composite symbols) that wer _ 
Joined during the source reduction process, Since they must be individuall 


coded,apair ofrecursive calls (to makecode)j is issued for the elements 一 alon 2 
with two appropriately updated code words (a 0 and l are appended to inpu 2 
codeword). When sc does not contain a cell array,it is the index of an origina 1 
source Symbol and is assigned a binary string created 他 om input codewor 2 
Using CODE{Sc} = char('0: + codeword), As was noted in Section 2.10.5 Ka 
MAILAB function char converts an array containing positive Integers tba 


Tepresent character codes into a MATLAB character array (the first 127 code 
are ACID) Ihus, for example, char('0' + [010]) produces the characte 
String '010 ,Since addinga0totheASCIH code fora0yieids an ASCII '0， 
whlle adding a l to an ASCII '0' yields the ASCII code for a 1 namely 二 

Table 8.2 details the sequence of makecode cajlls that results for the sourc 
reduction cell array in Fig. 8.3. Seven calls are required to encode the fou 
symbojis of the Source. The first call (row 1 of Table 8.2) is made from the main 
Ioutine of huffman and lannches the encoding process with inputs codeword 
and sc set to the empty matrix and cell array s, respectively In accordanc 
with standard MATLAB notation, {1x2 ceL1} denotes a cell arTay with one : 
IOw and two columns. Since Sc is almost always a cell array on the first cal 
(the exception is a single symbol source), two recursive cajls (See rowSs2 and7 
of the table) are issued.The first of these calls initiates two more calls (TOWS 3 
and 4) and the second of these inijtiates two additional calls (rows 5 and 6) 








011 
才 半 

010， 
00 


main routine 


makecode 
makecode 
makecode 
makecode 
makecode 
makecode 


4 2 

4 4 

1 2 

2 2 
Si1ze Bytes 
4X4 16 


{1Xx2 cell} 

【2j 

[4] {11x2 Cel11}+ 
4 

[3j [1 


C1aSsS 


Uint8 array 


rand total is 16 elLlementS USing 16 bytesS 


>C = huffmanthist(double(f2(:))，4)) 


8.2 筑 Coding Redqundancy “295 


codeword 





ach pixel in f2 is an 8-blt byte; 16 bytes are used to represent the entlre 
mage. Because the gray levels of f2 are not equiprobabjle, a varlable-iength 
ode (as was indicated in the last section) will reduce the amount of memory 
equired to represent the image. Function huffman computes one Such code: 


TABLE 8.2 

Code asSignment 
Process for the 
SoOurce reduction 
celi array in 
Fig.8.3， 


EXAMPELE 8.2: 
Varlabje-length 
code mapplngs in 
MATLAB. 
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ie, cell arTray h1f2 ls transformed into a 3 X 16 character array, h2f2. 下 ach 
mn of h2f2 corresponds to a pixel of f2 in a top-to-bottom left-to-right 
columnwise) scan. Note that blanks are inserted to size the aITay PITOpPer- 
and, since two bytes are required for each '0' or '1' of acode word, the 
引 memory used by h2f2 is 96 bytes 一 still six times greaterT than the original 
bytes needed for f2. We can eliminate the inserted blanks using 


Since Huftman codes are based on the relative frequency of occurrence of 证 
SOuUrce Symbols being coded (not the symbols themselves), c is identical to 二 
code that was constructed for the image in Example 8.1. In fact, image f2 ， 
be obtained from f in Example 8.1 by mapping gray levels 107, 119， 123， 
108 to 1, 2, 3, and 4 respectively, For either image,p = [0.1875 0.5 0.1i 
0.1875]. 

AsSsimple way to encode f2 based on code cjsto perftorm a straightforw: 
lookup operation: 


2f2 = h2f21(:) | 


hh2f2(n212 = 二] 
时 Whos( 'h2f2 ' ) 
ee ”Name Size Bytes Class 


h1Tf2 = 
h2f2 29X1 58 char array 
Columns 1 through 9 
了 nd total 1S 29 elements USing 58 bytes 
由 010. 术 0141 010， 国 的 011 00: 
CoJLumns 10 through 16 the required memorty is still greater than f2'S original 16 bytes. 
00 0 00 ， 9 compress f2, code c must be applied at the bit ljevel with several encod- 
ixels packed into a Single byte: 
>> Whos( 'h1if2 ' ) ECAS 8 7 
Name Size BytesSs CJLassS 
h3f2 = mat2huff(f2) 
htf2 1Xx16 1S30 Cell array 


Grand total is 45 elements using 1530 bytes size: [4 4] 
min: 32769 

hist: [3 8 2 3] 

Code: [43867 1944] 

“Whos( 'h3f2 ) 

Name Size Byt+tes CILIaSS 


h3f2 1X1 518 struct array 


Here，f2 (a two-dimensional aITay of class UINT8) is transformed into 
1 X 16 cell array, h1f2 (the transpose compacts the display). The elements 
h1f2 are Strings of varying length and SOITesSpond to the pixels of f2 in atop- 
bottom left-to-right (i.e.， columnwise) scan. As can be seen the encoded imag 
uses 1530 bytes of storage 一 -almost 100 times the memory Tequired by f2! 

Jhe use of a cell array for h1f2 is logical because it is one of two standar 
MAILAB data structures (see Section 2.T0.6) for dealing with arrays of dis 6rand total is 13 elements Using 518 bytes 


Although function mat2huff returns a structure, h3f2， ITequiring 518 bytes of 
lemory, most of it 18 associated with either (1) structure variable overhead 
Iecall from the Section 8.1 discussion of imratio that MATLAB uses 124 
ytes of overhead per structure field) or (2) mat2huff generated information 
0 facilitate future decoding. Neglecting this overhead，which is negligibjle 
hen consldering Practical (ie.,normal size) images,mat2huff compresses f2 
ya factor of 4:1. The 16 8-bit pixels of f2 are compressed into two 16-bit 








transforming hf1f2 into a conventional two-dmensional character array': 


>> h2f2 = chan(hif2)， 


h2f2 = 
1010011000011011 ords 一 the elements in field code of h3f2: 
11+ 1001 0 
ee > hcode = h3f2.code， 
>> Whos( 'h2f2 ' ) > Whos( 'hcode ' ) 
Name S1Lze Bytes C1LasS Name Size Bytes CJLass 
h2f2 3x16 96 char array hcode 1X2 4 uUint16 array 


Grand total is 48 elements using 96 bytes Grand total is 2 elements using 4 bytes 
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2 decabin 


COPPeIG G cecipyat 
PIHESer iod DGAY 
317008. ZEOr OAe 天 - 
Jorpialiom ipe 

>> helpdec2bin. 


iatr2hufT 和 ff 
RE 


>> dec2bin(doubLlelhcode ) ) 


引 nS 三 
1010101101011011 
0000011110011000 


Note that dec2bin has been empioyed to dispjay the individuajl bits 
h3f2.code.Neglecting the terminating modulo-16 pad bits (ie., the finalth 


Us)，the 32-bit encoding is equivalent to the previousiy generated 


5ection 8.2.1) 29-bit instantaneous uniquely decodable block cod 


10101011010110110000011110011. 


As was noted in the preceding example, function mat2huff embeds thei 
formation needed to decode an encoded input array (e.g its Ofiginal dime 
Silons and Symbol probabilities) in a single MATLAB structure variable Th 
information in this structure is documented in the help text section 


mat2huff itself: 


function yY = mat2huff(X) 
%MAT2HUFF HuTfman encodes a matrix， 
Y = MAT2HUFF(X) Huffman encodes matrix X using Sympol 
probabilities in unit-width histogram bins between X's mininmun 
and maximum valueSs，The encoded data is returned as a structure 
站 
Y.code The Huffman-encoded values of X，stored in 
a Uint16 vector，The other fields of Y contain 
additional decoding information，including: 
Y,man The minimum Value of X plus 32768 
Y,Slze The Size of X 
Y,hist The histogram of X 


If X is 10gical，uint8，uUint16，uint32，jint8， int16，or doub1le， 
With integer Values， 让 can be input directly to MAT2HUFF，The 
minimum Value of X must be representable as an int16. 


If X is doubje with non-integer values---for example，an image 
with Values between 0 and 1---first Scale X to an appropriate 
integer range before the cal1，For example，Uuse Y = 
MAT2HUFF(255*X) for 256 gray level encoding 


NOTE: The number of Huffman code words 1i5 round(max(Xx(:))) - 
roungtnmin(X(:))) + 1. You may need to Scale input X to generate 
codes of reasonable length. The maximum row or column dimension 
of X is 65535. 


吧 oppocecpocecocec oo oo oo do do do oO op one ceoco de do de 


当 96e al1S0 HUFF2MAT ， 


if ndims(x) -= 2 | -isreal(xX) | (~-isnumeric(x) & ~1SJogical(x)) 
error( X must be a 2-D real numeric or 10gical matrizx，); 
end 



















tore the Size 0f input X， 
jze = Uint32(SlIze(Xx) ) 


ind the range of X values and Store jitSs minimum value biased 
DY +32768 as a UINT16. 

round(double(x) ) ; 

= min(x( ))) 

maxtX(:)) 1 

doublel(Int16(Xxminy) ) ; 

Uint16(pmin + 32768 ) ; ynmin = pmin; 


Compute the input histogram between xmin and Xmax With Unit 
Width bins，scale to UINT16，and store， 

X(:) 

histc(X，Xmin:Xxmax) ; 

ax(h) > 65535 

= 65535 * h / max(h) ; 


- uint16(h); ， yvhist = h; 
gode the input matrix and Store the_ result ， 


= huffmant(doubleth) ) ; 5 Make Huffman Code map 
map(xf(:) ~ xmin + 1) 1 % Wap image 

char(hx) ; %5 COonvert to Char array 
hx(: ) 


emove blankS 





hx = ) = []; 

















e = Cell(]Jength(hx) / 16) ; ompute encoded SI1z6 
= repmat( 0 ，1， re-allocate modul0o-16 vector 
16(1;1Length(hx)) = hx 


esShape to 16-character woprds 


onvert binary StFring to decimal 


和 
) 和 
yYSize wx 16);) 多 
X 和 
= presShape(hx16，16，yYSize ) | 和 5 


R 

C 

P 

Make hx modulo-16 in length 
R 

C 


= hx16' 一 0 
WOS = pow2(15:-13:0) 
code = Ulint16(Sum(hxj16 .* twosS(oneS(yS1ize，1)，:)，2)) 


Note that the statement y = mat2huff(x) Hufftman encodes input matrix X. 
Using unit-width histogram bins between the minimum and maximum Values 
fx, When the encoded data in y.code is later decoded, the Huftman code 
ieeded to decode it must be re-created from y.min,the minimunm value of X， 
nd y ,hist,the histogram of x. Rather than Preserving the 豆 uftman code it- 
li mat2huff keeps the Probability information needed to ITegenerate 1t. 
With this, and the original dimensions of matrix x, which is stored in y.size， 
， function huff2mat of Section 8.2.3 (the next Section) can decode y.code to 


econstruct X. 
The steps involved in the generation of y.code are Summarized as follows: 





Compute the histogram, h, of input x between the minimum and maxi- 
mum values of x_ using unit-width bins and scale it to fit in a UINEI16 


Vector 
Use huffman to create a Hufftman code, called map, based on the scaled 


histogram,h. 
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AH Fnctiom 后 SEE 
trtio hist. or 
1a0re cteiails ype 

>> help histcC. 
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Temoving the coding redundancy associated with its conventional 8-bit bi- 
y encoding, the image has been compressed to about 80% of its Origlinal 
even with the lncluslon of the decoding overhead information)， 

Since the output of mat2huff is a Structure, we write it to disk using the 


e function: 


3 Map input x using map (this creates a cell arTay) and convert it to a cha 
acteI arTay，hx,， Temoving the blanks that are inserted like ip h2f2 浊 
上 Example 8.2. 

4 Construct a _ version of vector hx that arranges 1lts characters into 
character segments. This is done by creating a modulo-16 character Vect 
that w 岂 hold it (hxf16 in the code), copying the elements of hx into if 
reshapingit into a 16 row by ysize array where ySiZze= ceil (length(hx 
/ 16),. Recall from Section 4.2 that the ceil function rounds a numbert 
ward Positive infinity The generalized MATLAB function 





save SqueezeTracy c; 
cr2 = imratio(' Tracy,tif'，'SqueezeTPFacy .mat' ) 
= EUFCEOSYHIGT 

1 .2365 Savefilevanr 
人 StOres WOrKSsPpace 
Variqple var rodEK 


y = reshape(Xx，m，n) 
e Save function, like the Save Workspace As and Save Selection As menu 1 区 





Teturns 30 四 by n matrix whose elements are taken column wise from commands in Section 1.7.4, appends a ,mat extension to the file that is creat- jieecalled 

An error is returned 过 x does not have mxn elements iThe resulting file 一 in this case, SqueezeTracy .mat,is called a 1H47THje.It( Tie'nat 
3 Convert the 16-character elements of hx16 to 16-bit binary numbers (i abinary data file containing workspace variable names and values. Here ,it 

unit16'S). Three statements are substituted for 由 e more compact y ntalins the Single workspace varlable c.Finally we note that the small differ- APhie 


ce In coOmPression Tatios cr1 and cr2 computed previously is due to 


uint16(bin2dec(hx16 ') ). They are the core of bin2dec, which returns 人 th 
TLAB data file overhead. 


ON decimalequivalent of a binary String (e.g.,bin2dec('101' ) returns S) but ar 
faster because of decreased generajlity MATLAB 名 nction pow2(y) is usedt 
TIeturm an array whose elements are 2 raised to the y power. That is, twos: 
pow<(15:; -1: 0) creates the array [32768 16384 8192 .814 21|]. 





$2.3 Huffman Decoding 

ftman encoded images are of little use unjless they can be decoded to re- 
， . te the orlginal images 丰 om which they were derived. For output y = 

上 EXAMPLE 8.3; 泪 To illustrate further the compresslon perftormance of Huffman encoding; iat2huff(Xx) of the previous section, the decoder mnust first compute the 


Encoding with consider the $12 X -bi 
mat2huff， Sbit monochrome ima Juftman code used to encode x (based on its histogram and related informa- 
ion in y) and then inverse map the encoded data (also extracted from y) to re- 


iuild x. As can be Seen in the following listing of function x = huff2mat (y)， 
lis Process Can be broken into five basic steps: 


Extract dimensions m and n, and minimum value xmin (of eventual out- 
put x) from input Structure y， 

Re-create the Huffman code that was used to encode x by passing its his- 
togram to function huffman.The generated code is called map in the listing. 
Build a data structure (transition and output table 1ink) to streamjine the 
decoding of the encoded data in y.code throupgh a series of computation- 


sequence: 


>> 二 = imread(' Tracy.tif' ) ， 
>> C = mat2huff( 下 ) ; 
>> Cr1 = imratio( 下 ， 罗 


crl = 
1.2191 


a bb 

FIGURE 8.4 和 A ally efficient binary searches. 

512 X 512 8-bit Fass the data Structure and the encoded data [ie., Link and y,code] to C 

monochrome function unravel1. This function minimizes the time required to perform 
the binary Searches, creating decoded output vector x of class double. 


Image of a woman 
and a close-up of 
her right eye. 


Add xmin to each element of x and reshape it to match the dimensions of 
the original x (ie.,mIrows and n columns)， 


A unique feature of huff2mat is the Incorporation of MATLAB callable C 
junction unrave1l (see Step 4), which makes the decoding of most normal res- 


ution images neariy instantaneous. 

function x = huff2mat(y) huff2mat 
UFF2MAT decodes a Huffman encoded matrix. We 
当 X = HUFF2MAT(Y) decodes a Huffman encoded structure Y with Uint16 


fl1el1ds: 
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Y ,mnn Minimum Value of X plus 32768 
Y,S1Zze Size of X 

Y.hist Histogram of X 

Y.code HuUffman code 


canned Huffman encoded String 一 which must of course be a '0' or a 1 ' 一 
iggerS 3 binary decoding dqecision based on transition and output table Link. 
ie construction of link begins with its initiallization in statement Link = [2; 
0]. Each element in the starting three-state Jink array COTresponds to a 
ufftman encoded binary string in the corresponding cell array codel initially， 
0de=Cellstr(char( ，0'，1)).Ihenullstrng,code(1),isthestarting 
Dint (or initial decoding state) for all Hufftman string decoding.The associated 
训 Link(1) identifies the two possible decoding states that follow from ap- 
endinga '0' and '1' tothenullstrng. 开 the next encountered Hufftman en- 
ded bit is a :0 ,the next decoding state js Link(2) [Since code(2) = '0',the 
ull string concatenated with '0'];ifitis al',the newstateis Link(3) [at 
dex (2 + 1) or 3, with code(3) = 1]. Note that the corresponding Link 
Tay entIles are 0 一 indicating that they have not yet been processed to reflect 
e Proper decisions for 再 ufftman code map. During the construction of Link, 让 
ther string (ie.,the '0' or '1')isfoundin map (ie.,itis avalid Hufftman code 
ordj, the corresponding 0 in 1ink is replaced by the negative of the corre- 
onding map index (which is the decoded vaiue). Otherwise, a new (positive 
lued) Link index is inserted to point to the two new states (Possible Huftman 
e Words) that logically follow (either '00' and '01' of '10' and “117)， 
Senew and as yet unprocessed Link elements expand the Size of Link (cel 
Tray Code must also be updatedj, and the construction Process 1S8 continued 


The output X is of class doub1le， 


9 oo oo oo oo do de 


See al1S0 MAT2HUFF ， 


if ~isstruct{(y) | ~isfield(y，'"min') | -isfield(y，'Size') | ,.， 
~ISfield(y， hist') | ~isfield(y，'code ') 
errort The input must be a Structure as Feturned by MAT2HUFF . ) ; 
end 


sz = double(y.Ssize); mm=sSz(1);， mn= SZz(2); 
xmln = doubJlel(y,nmin) - 32768; 生 Get X 和 印 inimum 
map = huffman(doub1le(y.hist) ) ， 点 Get Huffman code (cel1) 


5 reate a binary Search table for the Huffman decoding process 
名 “ Code” contains Source Symbol Strings Corresponding to "1Link' 

5 nodes，wWhile "link' contains the addresses (+) to node pairs for 
% node Symbol Strings plus '0' and :1 or addresses (-) to0 decoded 
二 Huffman codewords in 'map'， Array '1left' is a list of nodes yet to 
各 be processed for '1Link' entries， 


cogde CELLSER(CNhaRLC 0 于 汪 Set Starting conditions asS untj thbhere are DO UDProcessed elements left in Link. Rather than continuajly 
本 0 0] ; 局 = [2 3]; % 3 no0des W/2 unprocessed anning Link for unprocessed elements，however，huff2mat maintains a 
oundg = 0) tofind = length(map); s# Tracking variab1e6s cking array,called 1eft, which is initializedto [2, 3] and updated to contain 


主 


while length(left) & (found < tofind) the indices of the link elements that have not been examined. 


10oK = find(strcmp(map，code{left(1)]}))， % IS String in map? Table 8.3 shows the Link table that is generated for the Huftman code in 

2 L00K 名 YeS ample 8.2. If each link index is viewed as a decoding state, ji, each binary 
link(Jeft(1)) = -Lo00k; #% Polint to Huffman map ding decision (in a left-to-right scan of an encoded string) and/or Huffman 
2 Delete CUrrent node jecoded output is determined by Link(I): 
found = found + 十 ; 各 Increment codes found 名 

el1Se % NO，add 2 nodes & pointers 了 i Link(i) <0(ie,negative), a Hufftman code word has been decoded. 
len = ljength(code ) ， % Put pointers in node ” jlhe decoded output is | Link(i) 1, where | | denotes the absolute value. 
Tink(Ieft(1)) = len + 1 2. 开 link(i) >0f(ie,positive) and the nextencoded bit to be Processed is a 
Link = [link; 0; 0]; % Add unprocessed nodes 0, the next decoding state ls index Link(i).Thatilis,weleti=Iink(iy)， 


3 IfLink(i)>0andthe nextencoded bit to be processed js a l,the next de- 


code{fend + 1} = strcat{code{fleft(1)}，'07)， 
coding state js index Link(i) +1.Ihatis,i=1link(i) +1， 


code{tend + 1} = Strcat(codefleft(1)}， 1 ) 


Jeft = left(2:end); % Remove processed node 
jeft = [Jeft len + 1 len + 2]; 先 Add 2 Unprocessed nodes TABLE 8.3 
ccoding tabje 
for the SoufTce 
x = unravel(y,code'，Link，m * n)， % Decode Using C :Unravel' reduction cell 
x=Xx+xmin 一 1 % X minimnum offset adjust aITay in FI8. 8.3. 
Xx = reshape(Xx，my， mn) %5 Make Vector an array 2 


As indicated earlier, nuff2mat-based decoding is built on a series of bina 
searches oOT two-outcome decoding decisions. Each element of a sequentiajly 
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FIGURE 8.5 

Flow diagram for 
C function 
UnPravel. 





Start with 





Completely 
decoded? 






Start over with 
nm = 0 


Output 
|1Linkt(n)i 


Yes 
Link(n) < 0 









Found an 
Output yet? 





No 
Jink(n) > 0 





What is the 
Dext encoded 





n = 1Linktny) 





As noted previously, positive Link _ entries COfrespond to binary decoding 
transltions, while negative entries determine decoded output values. As eack 
Huftman code word is decoded, a new binary search is started at Link inde 
of 下 xample 8.2, the resujlting staft 
transition sequence is = 1,， 3，1， 忆 下 5e coOfTresponding output: 


= 1.For encoded string 101010110101. .， 


SeqUence 1S 一 ，|=-2| 本 一， |-3| 一 


in conjunction with Table 8.3. Note， however. that modifications are neededto 
compensate for the fact that C arrays are indexed from 0 rather than 1]|. 

Both C and Fortran functions can be incorporated into MATLAB and 
Serve two malin purposes: (1) They allow large Preexisting C and Fortran PrIo- 
grams to be called from MATLAB without having to be rewritten as M-files， 
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时 (2) they streamjine bottleneck computations that do not run fast enough 


MATLAB Mr-files but can be coded in C or Fortran for increased efficlency. -EMCAe 
hether 人 OFr Fortran js used, the resujting functions are referred to as 2 
已 Thilesi they behave as though they are M-files or ordinary MATLAB 4AM4TL4B 


EtKiernel Arazctiop 
Drodiced Fopl C or 
ortran Sotrce codte 
Payapiet 让 Pa- 
Gepermdeni etrie1lsion 
({e.8， .dl1 r 
Hadom9). 


ctions. Unjike Mr-files, however, they must be complled and linked using 
[AILAB s mex SCTipt before they can be called.To compile and link unravel 
fa Windows Platform from the MATLAB command line Prompt for exam- 


e, We tyPe 
> mex Unrave1l.Cc 


MEX-fle named unravel1.d11 with extension .dl11 will be created Any 

help text, f desired, must be provided as a separate M-file of the same name 人 
Will have a .m extension). 

The source code for C IPAX-jie unravel has a .c extension and follows: 





AReC5S5otrce code 
1Secd io Dr ae 


了 一 一 一 一 二 一 一 全 二 二 二 二 三 二 王 二 二 一 工 二 二 二 二 三 三 二 二 二 二 二 二 二 工 宕 全 二 三 二 二 兰 三 二 全 一 一 一 一 一 一 一 一 一 一 一 一 一 一 一、 
一 人 一 一 一 一 一 一 一 一 一 一 一 一 二 二 二 二 二 三 二 二 全 三 二 二 二 二 二 二 二 二 二 三 二 全 一 二 二 二 三 三 二 全 二 二 一 全 一 


Unravel.C 0 

pecodes a variable length coded bit sequence (a Vector of 

16-bit integers) Using a binary Sort from the MSB to the LSB 

(across Word boundaries) based on a transition table. 

= 了 ===================================*/ UnmravelT,c 
We 


01d unravel(unsigned Short *hx，double *JinkK，doub1e xx ， 
doub1e xsz，int hxsz) 
nt lz=15， j=0k=0n=0; /* Start at root node，1St */ 
/* hx bit and X element >*/ 
/* Do until X is filled */ 
/” Is there a 1Link? */ 
1 工 S Di 3 二 2/ 
/* Yes，get new node */ 
/* It'S0 So get newW node */ 
计 (1) 1--; else {jt+;i 过 = 15;) /* Set ij to next bit */ 
if (j > hxSsz) /* Bits left to decode? */ 
miexErrMsgTxt( "Out of Code bits 人 


While (xszZ 一 kK) { 
计 (*(link +n) >0) 1{ 
if ((*(hx + j) >> 1) & 0x0001) 
n= (Link + ny) 
elSen= *(link + n) 一 1; 


{ /* It must be a leaf node */ 
“(X + k++) = 一 *(Link + nm)， /* Output value */ 
= 0; } /* Start Over at root */ 


if (k == XSZ - 1) / IS one 1eft Over? */ 


*(X + K++) = -~*(]ink + nn); 


int nrhs，const mxArray *prhs[]) 


double *1link，*X，XSz; 
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unrayel， 四 


区 


The companion help text is Provided in M-hle unravel,nm: 


Ge oo dc oo oo ce cb oo co coe o co 


Unsigned Short *hX 
int hxSz; 


/* Check inputs for reasonab1eneSSs */ 
if(nprhs !5= 3) 

mexErrMsgTxt( "Three inputS required，) ; 
else if (nlhs > 十) 

meXxErrMsgTxt( "Too many output argumentS， ) ; 


/* IS 1last input argument a SCalar7? *1/ 
计 (ImxISsDoubjle(prhs[2]) | mxIsComplex(prhs[2]) 
mxGetN(prhs[2]) * mxGetM(prhs[2]) !5= 1) 
mexErrMSsgTxt( "Input XSIZE must be a SCcalar，) 


/x* Create input matrix pointers and get Scalar */ 
hx = mxGetPr(prhs[0] ) ; /* UINT416 */ 
1Link = mxGetPr(prhsE1]); /xDOUBLE */ 
xsSz = mxGetScalar(prhs[2]) ; /* DOUBLE */ 


/Get the number of elLlementS in hx */ 
hxsSz = mxGetM(prhsI0] ) 


/xx Create 'XSzZ X 1 output matrIx ”1/ 
plhs[0] = mxCreateDoubleMatrix(xSz，1，mXxREAL ) ; 


/* Get C pointer to a Copy of the output matrix */ 
Xx = 前 XGetPrf(pLhsEO] ) ; 


/* Call the C Subroutine */ 
UnravelL(hx，Link，Xx，XxSZz，hxSzZ) ; 


UNRAVEL Decodes a variable-length blt strean. 
X = UNRAVEL(Y，LINK，XLEN) decodes UINT16 input Vector Y based on 
transition and output table LINK， The elements of Y are 
considered to be a contiguous Stream of encoded bitSs--i,e.，the 
MSB of one element followSs the LSB of the previouSs element， 
XLEN is the number code words in Y，and thus the Size of output 
Input LINK is a transition and Output 


Vector X (Class DOUBLE) . 
table (that drives a series of binary Searches) ; 


1，LINK(0O) is the entry point for decoding，i.e.， 
2， If LINK(n) < 0，the decoded output is |LINKIn 
> 0，get the next encoded bit and transition to 
state [LINK(n) - 1] ji 计 the bit 1s 0，elSe LINK(n) ， 


3，If LINKIn) 


Like allCMEX-files.C MEXc-file unravel,c consists of two distinct Parts: 
aco1puialional rotiineandagatemway rouline.The computational routine， ajsa 


State 


》 





= 0， 


































jamed unravel, contains the C code that Implements the 1ink-based decod- 
Drocess of Fig. 8.$. The gateway Ioutine,，which must always be named 
sxFunction, interfaces C compnutational routine unPravel to Mr-file calling 
metion huff2mat, It uses MATLAB's standard MEX-file interface, which is 
ed on the toljlowing: 


.Four standardized input/output parameters 一 nLhs,plhs,nrhs,and prhs. 
These parameters are the number of jeft-hand-side output arguments (an 
integer), an array of Pointers to the left-band-side output arguments (all 
MATLAB arraysj, the number of right-hand-side input arguments (an- 
other integer), and an array of pointers to the right-hand-slde Input argu- 
Iments (also MATLAB arraysj, respectively 

AMATLAB provided set of 4PPplication Programa 1nterjface (APUD func- 
fions. API functions that are Prefixed with mx are used to create, acCCceSsS， 
manipulate,and/or destroy Structures of cljass mxArray. 上 or example， 


se fxXCal1loc dynamically allocates memory jike a standard CC calloc 
function. Related functions include mxMal1oc and mxRealLloc that are 
Used in place of the C maaloc and realioc functions. 

e。 fxGetScalar extracts a Scalar from input array prhs. Other mmXGet . . . 
functions, Like mxGetM,，mxGetN, and mxGetString, extract other typPes 
of data. 

se fxCreateDoub1leMatrix creates a MAILAB 汪汪 arTay for plLhs， 
Other mxCreate... functions, jike mxCreateString and mxCreate- 
NumericArray, facilitate the creation of other data types. 


API functions prefixed by mex perform operations in the MAILAB 
environment， For example，mexErrMsgTxt outputs a message to the 
MAILAB workspace. 


1 wwwvrira 昌 


Function prototypes for the API mex and mx roOutines noted in Item 2 of the 
list are maintained In MATLAB header files mex， h and mat FiXx .h， 









Input 
















eters that they use and provide valuable clues about their general operation. 
dditional information is available in MATLAB?S 尼 xteraal nterjfaces refer- 
ence manual， 
Figure 8.6 Summarizes the preceding discussion, detalls the overall struc- 
re of C MEX-file unravel,and describes the fiow of information between if 
and M-file huff2mat. Though constructed in the context of Hufftman decod- 
Ing, the concepts illustrated are easiy extended to other CC- and/or Fortran- 
ased MATLAB functions. 


和， 
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-XArray 
SmXGet ,, ， 
-XCreate ..， 





由 ixXCal1LeCc 





:exXErrMsgTxt 
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Because the gray levels of the images are not equally probable, variable_] Symboli 个 hb 
coding can be used to reduce the coding redundancy that would result fronm se FlGURE 8.8 和 
natural bjinary coding of their pixels: 0 We 
coding model: 
>> fl = imread('Random Matches.tif' ) ; 0 Nearest (a) encoder and 
>> C1 = mat2huff(f1) ; Integer (bj decoder. 
>> entropy(f1) ia 
ans = 
7.4253 
>> imratiof(f1，c1l) Decompressed 
image 
ans = 
、 1.0704 
>> f2 = imread('Aligned Matches.tif' ) ， 
>> CC = mat2huff(f2 ) ; Predictor 





>> entropyfTf2 ) 


ans = 
7.3505 


>> lmratio(f2，C2) 


coder and decoder, each contining an identical preaictor. As each successive 
l of the input image, denoted 太 ,is introduced to the encoder, the predic- 
ior generates the anticipated value of that pixel based on some ntumber of past 
inputs. The output of the predictor is then rounded to the nearest integer, de- 
ited 思 , and used to form the difference or prediction error 


CE = 广 四 


Same 《7.4253 and 7.3505 bits/pixel); they are compressed Similarly Which ls coded using a variable-length code (by the symbol encoder) to gener- 
tte the next element of the comPpressed data stream.The decoder of Fig.8.8(b ) 


mat2huff (with compression ratios of 1.0704 versus 1.0821). These observa 
Hons highlight the fact that variable-length coding is not designed to take ad econstructs en f 们 om the received variable-length code words and performs the 
让 Tverse Operation 




















ans = 
1.0821 


Note that the firstrorder entropy estimates of the two images are about { 


Fig. 8.7(c). Although the pixel-to-pixel correlations are more evident in that 户 =e + 站 
， 刀 六 天 


arlous local, global, and adaptive methods can be used to generate 户 . In 
most cases, however, the prediction is formed by a linear combination of 7 


the information carried by individual pixels is relatively small. Much of the vi 
Tevious PiXels.That is， 


Sual contribution of a single pixel to an image is redundant' it could have been 
guessed on the basis of the values of its neighbors. These correlationg are the 
underlying basis of interpixel redundancy. 

In order to reduce interpixel redundancies, the 2-D Pixel array normally 
used for human viewing and interpretation must be transformed into a moOTe 
efficient (but normally“nonvisual”) format. For example, the differences be- 
tween adjacent pixels can be used to represent an image. Transformations of 
thls type (that is, those that remove interpixel redundancy) are referred to as 
/MI0DPI18S. TIhey are called reversiple mappingy ifthe original image elements 
can be reconstructed from the transformed data set 

A Simple mapping procedure is ilustrated in Fig. 8.8.The approach, called: 
losSsless Predicfiye codinpg， eliminates the interpixel redundancies of closely 
Spaced pixels by extracting and coding only the new information in each pixel.， 
The new zajoraalion of a pixel is defined as the difference between the actual 
and predicted value of that pixel. As can be seen the SyStem consists of an 


并 
人 cound| 2 mi- 

;= 
here mm is the order of the linear predictor, “round” is a function used to de- 
ote the rounding or nearest integer operation (like_ function _ round in 
ATILAB),and the wj for = 1,2,... 7 are prediction coefficients. For 1-D 
Jear predictive coding, this equation can be rewritten 


了 
方 xyY) = round| 人 aij(xz, y 一 | 
{ 二 
Where each subscripted variable is now eXpressed explicitly as a function of 


Spatial coordinates x and y Note that prediction jx y) is a function of the 
Previous Pixels on the current scan line alone. 
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prediction Coefficients in F and the asSumption 0f 1-D 10SS18SS 
predictive coding, If F is omitted，filter F = 1 (for previous 
pixel coding) is asSumed ， 


M-functions mat21pc and Ipc2mat implement the predictive encoding ani 
decoding Processes just described (minus the Symbol coding and decodi 
steps) .Encoding function mat21pc employs a for loop to build simultane 
ly the prediction of every Pixel in input x. During each iteration, xs, which ! 
gins as a COPY of x,is shifted one columan to the right (with zero Padding used 
on the left), multiplied by an appropriate Prediction coefficient, and added 访 or(nargchkt1，2，narglin 让 1 《heck Input argumentS 
Predliction Sum p. Since the number of linear prediction coefficients is norm ， argin < 2 % Set default fiiter if omitted 
ly small the overall process is fast. Note in the following listing that 让 pred 二 1， 
tion fijter f is not specified, a single element filter with a_ coefficient of 


See also MAT2LPC， 





Used. f(end:-1:1); % Reverse the filter CoefficientS 
. n] = Size(y); Get dimensions 0f OUtput matrIix 
0 0 站。 有 | er = length(f) ; % Get order of Linear predictor 
ee CompresseS a matrix USlng 1-D 108S165S predictive Coding repmat(f，m，1) % Dupjiicate filter for vectorizing 
5 MATEEPC(X， FF) encodes matrix X using 1-D 10Ssless predictive- zeros(m，n + order)， % Pad for fst 'order' column decodes 
5 coding. A linear prediction of X is made based on the 本 和 
% Coefficients in F, If F is omitted，F = 1 (for previous pixel ecode the Output one CoLunmn at 3 二 Ime， Compute a prediction based 
各 Coding) is assumed，The prediction error is then computed and n the order previouSs elements and add it to the prediction 
%5 output as encoded matriIx Y， Pror， The resujt is appended to the output matrix being built， 
%5 jj=12n 
%5 9368e 81S0 LPC2MAT. jj = j + orderi 
xf(:，jj) =y(:，j) + round(Sum(f(:，order:-1:1) ,* ..， 
errortnargchk(1，2，nargin) ) ; %% Check input arguments x(:，(jj -= 1 人 :1:(jj -= order))，2))， 
if nargin < 2 5 Set defaujlt filter if omitted 
人 
end X( order + 1:end) 5%5 Remove Jeft padding 0 
X = double(x); #% Ensure double for Computations Consider encoding the image of Fig. 8.7(c) using the simple first-order lin- 开 XAMPLE 8.5: 
[m，n] = Size(X); % Get dinmensSions of Input matrix af predictor Lossjess 
P = zeros(tm，Pn) % Init Jinear prediction to 0 predictive coding， 
XS = Xj zc = zeros{m，1); % Prepare for input shift and pad ， jx,y) = roundlaAxzy 一 1 了] 
for j] = 1:Jlength(f) % For each filter Coefficient ..， redictor of thlis form commonjly is called a previors Pirel predictor and the 
xS = [zc xs(:，1:8nd -1)]; % Shift and zer0o pad Xx sponding predictive coding Procedure is referred to as differemtial coding 
P=Ppr+Tf(j)*Xxsi s Form partial prediction Sums reyviors pixel coding. Figure 8.9(a) shows the prediction error image that 
Sm sults with a = 1. Here, gray level 128 corresponds to a prediction error of 0， 
yY=X~ round(p); %5 Compute the prediction erFor -sp 
下 
Decoding function 1pc2mat performs the inverse operations of encoding 14000 FIGURE 8.9 
counterpart mat21pc.As can be Seen in the following listing,it employs annl (a) The Prediction 
. 12000 error image for 
eration for joop, wherenis the number of columns in encoded input matIix ! 了 ee Si 
Each iteration computes onljy one coilumn of decoded output x, since each d 2 本 10000 9 训 
coded column is required for the compnutation of all supsequent coiumans， To: 1 8000 (b) Histogram of 
decrease the time spent in the for loop, x is preallocated to its maxima | 1 6000 the prediction 
padded Size before starting the loop. Note ajlso that the computations em 和 人 4000 error 
ployed to generate predictions are done in the same order as they were 也 于 让 二 
Lpc2mat to avold foating point round-off erTor， 二 0 
四 0 
3 人 -6 6 8 
1pc2mat function X = LIpc2mat{y， 二 ) 2 x 104 
Re %LPC2MAT DecompresseSs 8a 1-D 10ossless predictive encoded matrix， 


%5 X=LPC2MAT(Y，F) decodes input matrix Y based on Lineanr 
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while nonzero positive and negative errors (under- and Overiestimates) 
scaled by mat2gray to become lighter or darker shades of 8ITay TeSpective 





















>> 车 = imread(' ALigned Matches.tif' ) ; 
>> ee mat2Jpcf(Tf) ; 

>> 1Lnmshow(mat2gray(e) ) ; 

>> entropy(e) 


日 


Danse the intormation itself is not essential for normal visual processing, Since 
hzelimination of psychovisually redundant data results in a loss of quantita- 
information, it is cajled dxazm6izatiom. This terminology is consistent with 
normal usage of the word, which generally means the mapping of a broad 
e of Input values to a limited number of output values. As it is an irre- 
ible operation (i.e., visual information is lost), quantization results in lossy 


ta compTresslon. 


ans = 
5.9727 


Note that the entropy of the prediction rror, e, is Substantially lower 
the entropy of the original image, f.The entropy has been reduced fro 
/1.3305 bits/pixel (computed at the beginning of this section) to 5$9 
blts/Pixel, despite the fact that for mm-bit Images，(7a 十 1)-bit Dumbers: 


Consider the Images in Fig, 8.10. Rigure 8.10(a) shows a monochrome EXAMPLE 8.6: 
age with 230 gray levels. Figure 8.10(b) is the Same image after uniform ”Compression by 


neeced to Tepresent accurately the resulting error Sequence. This reductigl， fantization to four bits or 16 possible levels. The resulting compression ratio ”quantization， 
In entropy means that the prediction error Image can be coded moree 1 Note that false contouring is present in the previously smooth regionas 
ciently that the original image 一 which, of course .is the goal of the mappi e Orlginal Image. This is the natural visual effect of more coarsely repre- 
Thus, we get ing the gray levels of the image. 
Figure 8.10(c) ilustrates the significant improvements possible with quanti- 
>> C = mat2huff(e) ; ation that takes advantage of the pecujliarities of the human visual system.AL- 
>> Cr = imratio(f，C) fiough the compression resulting from this second quantization also is 2 :1， 
c 广 = 和 se contouring 1s greatly reduced at the expense of some additional but less 
1.3311 bjectionable graininess. Note that in either case, decompression is both un- 
ssSatry and Inpossible (ie.,quantization is an irreversible operation)， 。 吕 
and see that the compression ratio has，as expected, increased from 1.08 
(when Huffman coding the gray levels directly) to 1.3311. 
The histogram of prediction error e is shown in Fig. 8.9(b) 一 and compnute 光村 
as folljows: FlGuRE 3 10 
>> [hn，x]j = hist(e(:) * 512，512)， 
>> figurei bar(x，h，'Kk'); (b) Uniform 


quantization to 
16 levels. (c) TGS 
quantization to 
106 levels. 


Note that it is highly peaked around 0 and has a relatively small variance 
comparlson to the input images gray-level distribution [see Fig, 8.7(d)]. Thi 
reflects, as did the entropy values computed earjier,the removal of a great dea 
of Interpixel redundancy by the prediction and differencing process. We co 
clude the example by demonstrating the iossless nature of the predictive co 生 
ing Scheme 一 that is, by decoding c and comparing it to starting image f: 


>> 9 = Lpc2mat(huff2mat(c) ) ; 
>> Compare(f，9g) 
ans = 

0 
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quantize 
3 








TD coOmzpDare Sr108 
S1 and S2 避 HOri8 
C4586 HSe S = 
stprcmpi(sSf，sS2). 































JTJhe method used to Produce Fig. 8.10(c) jl$ called zzPFroved 87Gy-sC 
UGS) gxamtization. It recognizes the eye's inherent Sensitivity to edges 
breaks them Up by adding to each pixel a pseudorandom number， whic 
generated trom the low-order bits of neighboring Pixels, before quantizing 让 
TesSult. Because the low-order bits are fairly random, this amounts to addi 
level of randomness (that depends on the local characteristics of the image: 
the artificlial edges normally associated with false contouring，Fun 
quantize, listed next,， performs both IGS quantization and the traditio 
jow-order bit truncation, Note that the IGS implementation is vectorized 
that input x 18 PrIocessed one column at a time. TIo generate 3a column of : 
4-bit result in Fig.8.10(c), a column sum s 一 initially set to all zeros 一 is form 
as the sum of one column of x and the four least significant bits of the existin 
(Previously generated) sums. 过 the four most significant bits of any x valiue : 
11il, however, 00002 is added instead.The four most significant bits of 也 
Sujtimg Sumas are then used as the coded Pixel values for the column be 
PITOcessed. 


SSk d.They usually entail a decrease in the image's Spatial and/or gIay-Scale 
ution, 开 the image is frst mapped to reduce interpixel redundancies， 
ver, the quantization can lead to other types of Image degradation 一 ]ike 
人 urred edges (high-frequency detail loss) when a 2-D frequency transform is 
jito decorrelate the data. 


Although the quantization used to produce Fig. 8.10(c) removes a great 
if psychovisual redundancy with littie impact on Percelved image quajllity， 
her comPpITession can be achieved by empioying the techniques of the pre- 
Is twWo Sections to reduce the resulting image's InterpIxel and coding redun- 
ancies，In fact，we can more than double the 2:1 compression of IO 
uantization alone.The following Sequence of commands combines IOGS quan- 
ion, lossless predictive coding,and Huftman coding to compress the lmage 
jg.8.10(a) to less than a quarter of its origlnajl Slze: 





imread( :Brushes .tif ' ) 
0 4， 1gS ) 
= doub1le(q) / 16; 

= mat21pc(qs) ; 


function yY = quantize(X，b，+type) 

%QUANTIZE Quantizes the elLements of a UINT8 matrix， 

%%5 YY= QUANTIZE(X，B，TYPE) quantizes X to B bits. Truncation 1iS 
s used unless TYPE Is 1i1gSs' for Improved Gray Scale quantization， 


error(nargchk(2，3，nargin) ) ; % Check input argunmentSs 
if ndims(x) -= 2 | -~isreaLl(x) | ,,， 
~ISnumeric(x) | ~isa(Xx，'uint8') 
error( The input must be a UINT8 numeric matrix，) ; 
end 


#%5 Create bit masks for the quantization = 和 


1]0 = Uint8(2^ (8 一 bj) -1); = Lpc2mat(ne) ; 
hi = Uint8(2 8- double(1o) -1T); 16 * ngqs ; 
mpare(q，ng') 


光 Perform standard quantization Un]leSS JIGS is Specified 
if nargin < 3 | -Strcmpi (type， 'i1gS ') 
yY = bitand{(x，hi); 


名 ELSe IGS quantization，Process COLumn-wise， I the MSB's of the 
5%5 pixel are all 1'S，the Sum 1iS Set to the pixel value，E1sSe，add 
% the pixel Value to the LS8'SsS of the previous Sum，Then take the 
多 NMSB SS of the Sum as the quantized Value ， 


rmse = compare(f，ngq) 


6.8382 


e1Se 
[m，n] = Size(X); s = zeros{(m，1); \ote that the root-mean-square error of the decompressed image isS abott 7 
hitest = double(bitand(x，hi) ~= hi); x = double(x); ylevejls 一 and that this error results from the quantization Step alone， ”区 
for j] = 123:n 
S=Xxf:，j)+hitest(:，j) .* doublel(bitand(uUint8(S) ，10)) JPEG Compression 
:，]j) = bitand(uint8(Ss 
本 ee he techniques of the previous sections operate directly on the Pixels of an 
end mage and thus are Spafial doaazrn je 纺 ody. In this section, we consider a fam- 


Jy of popular compression standards that are based on modifying the trans- 


Improved gray-scale quantization is typical of a large group of quantizatior 
procedures that operate dlrectly on the gray levels of the Image to be coI 相 mage coOmpression, to provide additional examples of how to reduce the 
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了 XAMPLE 8.7: 
Cormbining TIGS9 
quantization with 
lossjless predictive 


and Huftman 


coding. 
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3 

FIGURE 8.1] 
JPEGO block 
diagrarmm: 

(aj encoder and 
(Pb) decoder. 





























本 人 和 | 
i 于 


image redundancies discussed in Section 8.2 through 8.4, and to give tbe reai 











er a feel for the state of the art in image compression. IJhe standards Presente [6 11 10 16 24 1 40 51 ; 161 0 开 5 6 4 15 | 27 | 28 
(although we consider only apProximations of themy) are designed to handl I2 12 14 19 26， 58 60 5 人 16 26129 | 
。 ， i 
wide range of Image types and compresslon Tequirements. ee -一半 一 一 本 网 | 一 - 
In 1ra1asjp71z coding,a reversible, linear transform like the DFT of Chapter 人 口 5 多 相 0 
or the discrefe cosine rramsjfora (DCT) 1417 2 29 5 87; 80| 2 | (9111j18124131 40， 44 53 | 
广 -一 一- 一 -一 一 -一 下 一 一 ve 一 一 一 一 -一 一 一 一 一 一 一 
M-1N-1l (2x + 1)zrr (2y + 1)o7r 18 | | 瑟 ] 3 56 68 |109， 103| 77 10|119， 23 ， 32 | 39 45 152 | 54 
7T(u,y) = 了 (本 [CO 人 和 
2AMf 2A 4 35 55 | 64 81 1104113 9 20 | 22 | 33 | 38 46 1 51 55 60 
外 一 一 一 一 二 一 2 -人 一 一 - 人 -一 4 > 
j 1 | 
where 加 4 已 四 2 _ 0 4 37 人 20| 1 
1 72 | 2 .1 2 2 100 103| | 35 36 | 48 和 57 | ES 62 | 63 ， 
和 
可 AM 
ct) 二 7 omputed. Thbe resulting coefficients are then Simultaneously normalized and 
有 二 uantized in accordance with 
后 和 了 (全 ) 
[and similarly for a(2)] is used to map an image into a set of transform COef 伍 7(z,V) = round 序 CE 
cients, which are then quantized and coded. For most natural images, a sign 许 
cant nUmber of the _ coefficients have Smaill magnitudes and can be COarsely ere 7( 1， 7 for 1 1 三 0， 1 ...,7 are the Tesulting normalzed and quan- 


zed coefficients, 7(x, yj is the DCT of an8gx8block ofimage AF(x,y), and 
(OO) is a transfprra moralization ar7ay ke that of Fig. 8.12(a). By scaling 
(2 avariety of compression rates and reconstructed image qualities can 
e achleved. 

After each block's DCT coefficients are quantized, the elements of 7(u， 也 ) 
are reordered in accordance with the zigzag Pattern of Fig. 8.12(b). Since the 
Sulting one-dimensionaliy reordered array (of quantized coeffticients) is 
ualitatively arranged according to _ increasing Spatial frequency, the Symbol 
icoder of Fig. 8.11(a) is designed to take advantage of the long funs of zeros 
at normajlly result from the AN In barticular, the nonzero AC coe 人 fi- 
ents [ie., al 7( V) except & = = 0] are coded using a variable-length 
de that defines the coefficient'"s 41d Dumber of preceding zeros. The 
C coefficient [ie.， 7(0， 0)j is difference coded relative to the DC coefficient 
the previous subimage.DefaultAC and DC Huffman coding tables are pro- 
ded by the standard,but the user is free to construct custom tables as well as 
全。 .00Imallzation arrays, which may in fact be adapted to the characteristics of the 
Image being compressed. 

While a full implementation of the JPEG standard is beyond the scope of 
1 chapter,the following M-file approximates the baseline coding Process: 


quantized (or discarded entirely) with little image distortion . 


.3 JPEG 
One of the most popular and comprehensive continuous tone, still frame Com 
PrIesSlon standards is the JPEG (for Joint Photographic Experts Group) stan: 
dard. In the JPEG paselinre coding System which js based on the discrete cosine 
transform and ls adequate for most compression applications, the input and 
output images are limited to 8 bits, while the quantized DCT coefficient values 
are restricted to 11 bits. As can be seen in the simplified biock diagram of 
Fig. 8.11(a),the compression itself is performed in four sequential steps: 8 X 8 
subimage extraction，DCT computation，quantization，and variable-length 
code asslgnment， 

The first step in the JPEG compression Process js to subdivide the input 
Image Into nonoverlapping pixel blocks of size 8 X 8. They are subsequently 
Processed left to right, top to bottom. As each 8 Xx 8 block or subimage is 
Processed, its 64 pixels are level shifted by subtracting 277~1 where 2m is the 
Tumber of gray levels in the image, and its 2-D discrete cosine transform 这 
















jnput |j8 X 8 block Normaljlizer/ Symbol Compressed 9 nction Y = im2jpeg(X， quUality) 
image extractor quantizer encoder image JW2JPEG Compresses an image USing a JPEG approximation， 
Y = JIM2JPEG(X，QUALITY) compresses image X based on 8 X 8 DCT 
transforms，Ccoefficient quantization，and Huffman Symbol 
下 coding，Input QUALITY determines the amount of information that 
Eee | Denormalizer 人 is lost and compression achieved, Y is an encoding structure 
image decoder DCT meTger 1mage 
containing fields: 
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FIGURE 8.12 
(aj The defanult 
JPEG 
normalization 
array. (b) The 
JPEG zigzag 
coefficient 
ordering 
Sequence. 


Im2jpeg 
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和 Y,SIze Size of X (count + 1):end) = []; 和 Delete Unusued portion of 『 

名 Y.numblocks Number of 8-by-8 encoded blocks 

反 Y.quality Quality factor (asS percent ) RE 1 xn] ) ; 

。 Y,huffman 机 Structure，as returned by quality = uintt6(quality * 100)， 

网 huffman = hat2huff(r) ; 
% See al1SO JPEG2IM， 


In accordance with the block diagram of Fig. 8.11(a), function im2jpeg 


ee | 0CcesSes distinct 8 X 8 sections or piocks of input image x one block at atime 
error('The input must be a UINT8 image.')|; ather than the entire image at once). Two Specialized block processing func- 
end tions 一 bJKproc and im2col1- 一 are used to simpjlify the computations. Function 
if nargin < 2 blkproc, whose standard SyntaX is 
quality = 1 % Default value for quality， 2 
end B = blikproc(A，EM N]，FUN，P1，P2，..，,)， 
= [1611 10 16 24 40 51 61 % JjPEG normalizing array 


treamjlines OF automates the entire process of dealing with images in blocks.It 


12 12 14 19 26 58 60 55 % and zig-zag redordering 
accepts an input Image A， along with the SIZe (IM N]) of the blocks to be 


14 13 16 24 40 57 69 56 5 Pattern， 
14 17 22 29 51 87 80 62 

18 22 37 56 68 109 103 77 

24 35 55 64 8 104 113 92 

49 64 78 87 103 321 420 101 

72 92 95 98 142 100 103 99] * quality | 


tional 1 parameters PT1，P2，... for block processing function FUN. Func- 
ion blkproc then breaks A Into MXN blocks (inciuding any zero that 


and reassembles the resujts into output image B. 


order = [19 2 3 140172518114 5 12192633 .,， The second specialized block Processing function used by im2jpeg is func- 
41 34 27 20 13 6 7 14 21 28 35 42 49 57 50 ..， ion im2col, When blkproc is not appropriate for implementing a specific 
43 36 2922158 16233037 44 .51 58 359 52 lock-oriented operation, im2col can often be used to rearrange the input so 
45 98 31 24 32 39 46 53 60 61 54 47 40 48 55 …: hat the operation can be coded in a simpler and more efficient manner (e.g.， 
6< 6563 56 64] allowing the operation to be Vectorized)j. The output of im2col is a matrix 

[xm，xn] = Size(X) ; % Get input Size， in Which each column contains the elements of one distinct block of the input 

x = double(x) - 128; % Level Shift input image. Its standardized format is 

t = dctmtx(8) ; 站 Compute 8 X 8 DCT matrix 


% Compute DCTS of 8x8 blocks and quantize the CoefficientSs. B = lm2col(A，[MNj，'distinct ' ) 


y = blkproc(X，[8 8]，':P1* X* P2 ，Tt Tt )， 

yY = blkproc(ty，[8 8]， round(x ,/ p1)， 交 和 3 here parameters A, B, and [MN] are as were defined previously for function 
y = in2colly，[8 8]，'distinct'; % Break 8x8 blocks into columns 1lkproc. String 'distinct' tells im2col that the blocks to be processed are 
xb = sizely，2); % Get number of blocks onoveIlapPling; alternative string 'SLiding' signajls the creation of one col- 
y = yl(order，?:); % Reorder column elementSs mn in B for everIy pixel in A (as though a block were slid across the image). 


In im2jpeg,function blkproc is used to facilitate both DCT compnutation 
nd coefficient denormalization and quantization，while im2col is used to 
mpjlify the quantized coefficient reordering and zero TUn detection. Unlike 


Create end-of-block Symbol 


oo 


eob = max(x(:)) + 1 
F = zeros(numel(y) + Size(y，2)，1) ; 


Count = 0 

for j = 人 %5 Process 1 block (col) at a tine he JPEG standard, im2jpeg detects only the final run of zeros in each re- 
i = max(find(y(:，j))); % Find 1ast non-Zzero element ordered coefficient block, replacing the entire run with the single ecob Symbol， 
if isempty(i) % NO nonzero block ValuesS Finally we note that although MATLAB provides an efficient FFTbased 

工 二 : 昌 function for large image DCTS (refer to MATLAB' help for function dct2)， 

end ne2jpeg uses an alternate matrix formulation: 
p = COount + 了 
9=pPr+iILi T = HRHT7 


Truncate trailing 0'SsS，add EO0B， 
and add to output vector 


rtp:q) = [yY(11，]) 8eobj 
Count = Count + 工 + 
end 


Where Fisan8 x 8block ofimage F(xz, y),Hisan8 x 8DCTtransformation 
matrix generated by dctmtx(8), andTis the resulting DCT of 了 Note that 


oo 品 
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7D comzpHie ipne 万 CT 
Off 六 8 X 8HoHnove 广 
Iapping DiocKs ring 
纺 e Qt OPerafion 
h*fxh ,ierh = 

过 ctmtXx(8). 
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Get X COolLumns 

Get X FowsS， 

Huffman decode， 

Get end-of-block Symbol 


Form block columns by copying 


5Z(2) ， 

sZ(1) 
huff2mat(y,huffman ) ; 
= maX{X(:) ) 


the “is used to denote the transpose operation. In the absence of quantizatio 
the InveIse DCT of 工 ls 


F = HZTH 


JThis formujation KM particularly effective when transforming Small square in 


omoo 加 ooce 





ages (like JPEG's8 X8PDCTSs).Thus,the statement ii ee 
if X(K) == 60b to the next CoLlumn Whenevenr 
yY = blkproc(x，[8 8]，PI*Xx*P2，h，h') k = k+ 1 break; an E08B Symbol is found， 
eJSe 
computes the DCIS of imagexin8 X8blocksusingDCTtransform matrix z(i，j) = Xx(K)， 
and transpose h'， as Parameters P1 and P2 of the DCT matrix maultiplicatigi k= kt+1i 
人 end 
Slimijar block processing and matrix-based transformations [se 
Fig.8.11(b)j] are required to decompress an im2jpeg compressed image. Fun' 
tlon jpeg2im, listed next, performs the necessary Sequence of inverse Opef: (rev，:) | % Restore order 
tions (with the obvious exception of quantization). It uses generic function -CoJ2im(z，[8 8]，[fxm xn]，'distinct' ); % Form matrix blocks 
Xi= blkproc(x，[8 8]，'X ,* P1'，m); 点 Denormalize DCT 
人 dctmtx(8) ; %5 Get 8X8 DCT matrix 
ne ee [8 8]，'P1* xx pP2，t ti 光 Compute block DCT-1 
Uint8(Xx + 128) ; 和 Level Shift sse 
jpeg2in function x = jpeg2im(y) 

0 名 jPEG2IM Decodes an IM2JPEG compressed image ， tfofe-create a 2-D image from the columns of matrix z where each 64-element 
5 X 二 JFE62IM(Y) decodes Compressed image Y，generating lumn is an 8 X 8 block of the reconstructed image,. Parameters A, B,[M N1]， 
et approxamation xx Y 1S a Structure generated by and distinct' are as defined for function im2col,while array [MMNN] spec- 
he dimensions of output image A. 

5 See alsS0 IM2JPEG， 


igures 8.13(a) and (b) show two JPEG coded and Subsequently decoded EXAMPLE 8.8: 


error(nargchk(1，1，nargin) ) ; 多 Check input argumentSs proximations of the monochrome image in Fig.8.4(a).The first result, which JPEG 
nmn=[lI611 10 16 24 40 5 6 % JPEG normalizing array Tovides a compression ratio of about 18 to 1, was obtained by direct appjica- ”compression. 
12 12 14 19 26 58 60 55 % and zig-zag reordering on of the normalization array in Fig. 8.12(a). The second, which COIDPTreSsSes 
14 13 16 24 40 57 69 56 % _ pattern， original image by a ratio of 42 to 1, was generated by multiplying (scaling) 


14 17 22 29 5!1 87 80 62 
1]8 22 37 56 68 109 103 77 
24 35 559 64 8 104 113 92 
49 64 78 87 103 121 120 101 
72 92 95 98 112 100 103 99] ; 


he normalization array by 4. 

jhe differences between the original Image of Fig. 8.4(a) and the recon- 
tructed images of Figs. 8.13(a) and (bj are shown in Pigs. 8.13(c) and (dj, re- 
ctively. Both images have been scaled to make the errors more visible.The 
resSponding rms erTors are 2.5 and 4.4 gray levels.The impact of these errors 


order = [49 2 3 10172518114 5 12192633 .,,， sn picture quajlity is more visible in the zoomed images of Figs. 8.13(e) and (人 . 
41342720136 7 1421283542 4957 50 .,.， aese images Show a magnified section of Fipgs. 8.13(a) and (bj, respectively， 
43 36 29 22.15 8 16 23 3037 44 51 58 59 52 .,， allow a better assessment of the subtle differences between the recon- 
4538312432394653960615447 404855 .…. shucted images. [Figure 8.4(b) shows the zoomed original.] Note the PiocKirg 
3 交 9 sortijacr that is present in both zoomed approxlimations. 
nn 5 《ompute :nyverse ordering The images in Fig.8.13 and the numerical results just discussed were gener- 
FE 他 T 二 于 hofopdar 0 站 with the following sequence of commands: 
end f = imread('Tracy.tif' )， 
由 = double(y.quality) / 100 * mi % Get encoding quality， > cf = im2jpeg(f) ; 
xb = double(y,numblocks) ; % Get x blocks， 11 = jpeg2im(c1) | 


SZ = double(y,size)， ”jimratio(f，c1) 
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FIGURE 8.13 Lett 
column: 
Approxliimations 
of Fig.8.4 using 
the DCT and 
normallization 
arITay of 

Pig. 8.12(a)j. Right 
coljumn: Similar 
TesSults with the 
normalzation 
arTay Scaled by a 
factor of 4. 
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im2jpeg(T，4) 
“f4 = jpeg2lm(c4) ; 
imratio(f，Cc4) 






ompare(fT，f4，3) 


6 TIesujts difter from those that would be obtained in areal JPEG baseline 


tios noted above would be approximately doubled. 蝎 


， 


Eee jdea that the coefficients of a transform that decorrejates the Pixels of an 
uage can be coded more efficiently than the original pixels themselves. If the 



























Wavejlet NormalizeI/ Symbol Compressed 
transform quantizer encoder Image 
汪 1 InVerse Reconstructed 
wavejet transform Image 
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as 


和 
FIGURE 8.14 
JPEG 2000 block 
diagTarmn: 

(aj encoder and 
(b) decoder. 
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FIGURE 8.15 
JPEG 2000 two- 
Scale wavejlet 
transform 
coefficient 
notation and 
anajysis galn 

(in the circles). 





































上 


iits used to Tepresent the original image and the analysis gain bits for subband 
让 Subband analysis gain bits follow the simple pattern shown in FIig. 8.15. For 
xample, there are two analysis gain bits for subband = 1 忌 

Or elIrorfree compression, Up = 0 and R = 8s) SO that A = 1. For irre- 
Wersible compression, no particular quantization step size is Specified. Instead， 
he number of exponent and mantissa bits must be provided to the decoder on 
bband basls, calljed explicit grantizatiom, or for the Ni ZL subband Only， 
ed IDLiCi gUaP6ZatOH. In the latter case, the remaining subbands are 
antized using extrapolated 从 忆 忆 subband parameters、 Letting so and Ho be 
ie number of bits allocated to the NM 世人 subband, the extrapolated parame- 
1 for subband p are 


compnuted. For error-free compPresslon, the transform used 1S biorthogonal， 
with a 9-3 coefficient scaling and wavejlet vector. In lossy applications, a 9-7 < 
efficient Scaling-wavelet vector (See the wavefilter function of Chapter 
employed. In either case.the inlitial decomposltion resujlts jn four subbands 
low-resolution apPpITOXimation of the Image and the Image'S horlzontal, ve 
cal,and diagonal frequency characterlstics. 
Repeating the decomPosition process Ar times, with Subsequent tteratioi 
restricted to the Previous decomposition s approxlmation coefticlents，PT 
duces an Ar -scale wavelet transform. Adjacent Scales are related Spatially 妓 
powers of 2,and the lowest Scale contains the only expllcltjy detined approx 
mation of the origlinal Image.As can be SUurmised from Fig. 8.13, where the 
tation of the standard 1S Summarlized for the case of Ni =2，a gener 
Ar-scale transform contains 3AMi + 1 subbands whose coefficlents are deng 
ed ap for = Ni ZLLN DLL 1ECL LE TIE The standard does 所 
specify the nurmber of scales to be computed. 
After the Au -scale wavelet transform has been computed, the total numt 
of transform coefficients 1S equal to the number of samples in the orig 
Image 一 but the ImPportant visual Information 18 Concentrated in a few coe 
clients. Jo reduce the number of bits needed to represent them，coeffici 
ap) of subband p is quantized to value gp(z, wj) using 


/5 ”0 
E 太 三 -60 二 jp PSan 


Where /sd5 denotes the number of subband decomposition levels from the 
iginal image to Subband D. The final step of the encoding pIOocess 1$ to code 
quantized coefficients arithmetically on a bit-plane basis. Although not 
ussed in the chapter, aritppzpaetic coding is a Variable-length coding proce- 
that, like Hufftman coding,is designed to reduce coding redundancy. 
Custom function im2jpeg2k approximates the JPEG 2000 coding process 
也 8. 8.14(a) with the exception of the arithmetic symbol coding. As can be 
nin the foljlowing listing,Hufftman encoding augmented by zero run-length 
ng is substituted for simplicity. 


寺 unction Y = im2jpeg2k(x，n，dq) 

JW2JPEG2K Compresses an image using a JPEG 2000 approximation. 

_Y = IMH2JPEG2K(X，N，Q) compresses image X using an N-Scale JPEG 
.2K wavelet transform，implicit or explicit coefficient 
-quantization，and Huffman Symbol coding augmented by zero 
-Pun-length coding， If quantization vector Q contains two 
“elements ， they are 3aSSumed to be implicit quantization 

- parameterSij elise，it is assumed to Contain explicit Subband step 
Sizes, Y is an encoding structure Containing Huffnman-encoded 
data and additional parameters needed by JPEG2K2IM for decoding， 


2 


number of bits aljjlotted to the exponent and mantissa of the Subband s coe 寺 ts 
clients, [he nominal dynamic range of subband pb ls the sum of the number 


-See also JPEG2K2TII， 
obal RUNS 


Tortnargchk(3，3，nargin) ) ， s Check input argumentSs 


) ) 
ndinms(X) -= 2 | ~isreal(x 
e 


| ~isnumeric(x) | ~isa(X，'uint8') 
-error('The input must b )， 


) 
a UINT8 image， 


ength(q) -= 2 & length(q) -== 3 * n+ 
rrort The quantization Step Size vector is bad.'); 


学 CU 落 一 了 本 
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imn2jpeg2KkK 


2 


2 aeromen -antyvanrAwna 
吕 
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% Level Shift the input and compute its Wavelet transfornm， 
x = double(x) - 128; 
[C，S] = Wavefast(X，n， jpeg9.7 ) 


点 Quantize the Wavelet Coefficients ， 


q = StepSlizetn，q) 

sgn = S1gnf{C) sgntffind(sgn == 0)) = 1 C = absf(c) ij 

for kK = 132n 
qi=3*rk 一 2; 
cC = Wavepaste('h'，C，S，K，Wavecopy( h' ，c，S，k) / q(qi)); 
C = Wavepaste( vV' ，C，S，K，Wavecopy( vV ，0，S， ki) /9q(qir+ 1)); 
C = Wavepastel( 'd ，Cc，S，K，Wavecopy{( d ，c，sS， k) /1 q(qi+ 2)); 

end 

C = Wavepaste('a'，C，S，k，Wavecopy( 'a ，Cc，S，k) / q(9i + 3)) 

C = 了 Loor(c); C 三 人 0 

% Run-1length code zero runs of more than 10，Begin by creating 

名 8 Special code for 0 runs ('zrc' ) and end-of-code ( 8e0oc ) and 


% making a run-length tabjle . 


zrCc = min(tc(:)) -1 eoC = Zrc 一 1; RUNS = [65535] ; 


% Find the run tpPansition points: “plus” contains the Index of the 
% Start of a zero Funi the el minus'” is its end + 1， 
Z=C== 0 Z=z~[0z(1:end -1T)]; 
plus = find(z == 1); Te find( 


% Remove any terminating zero Pun from 
if length(plus) -= Length(minus ) 

cftplus(end):end) = [| C = 
end 


-= 1) 


[c 8e0c] ; 


和 Remove all other zero runs (based on "plus'” and 'minus') from “C. 
for = length(minus) -1:1 
Fun = minus(I) -plLuS(I) ; 
if run > 10 
ovrflo = floorf(Pun / 65535) ; run = 
[cl(1:plus(i) - 1) repmat([zrc 1]， 
runcodelrun) cltnminus(I):end)]; 


Pun ~ ovrf10 * 65535 
1 DVPTTLO POCO< ws 


end 
end 
% Huffman encode and add misc. information for decodlng ， 
Y,Puns = Uint16(RUNS ) ; 
yY ,S = Unt16(S{(:) ) 
ye = UInt16{( 一 zZFc) ， 
Y .fd = Uint16(100 * 9 ) 
y,n = Ulintt6tn) ; 
Y,huffman = mat2huff(c) 
和 和 
function Y = Puncode(x) 


5 Find a zero run in the run-lLength table， If not found，create 3 


% new entry in the tab1e，Return the index of the Prun， 
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find(RUNS == X); 
engthty) ~= 1] 

RUNS = [RUNSi， X] 
) = Jength(RUNS ) ; 


ction q = Stepsize(n，p) 

reate a Subband quantization array of step Sizes ordered by 
ecomposition (first to Jast) and Subband (horizontal，Vvertical， 
iagonal，and for final decomposition the approximation Subband) , 


length{p) == 2 % ImpJicit Quantization 


dk =2“~KkK* gqni 
= [qt(2*gk)tkt2*gk) (4*qk)] 
end 
= [q qk]; 
5 EXplLicit Quantization 


光 5 Round to 1/100th Place 
error( The quantizing steps are not UINT16 representabJe，); 


ny{ 二 二 ) 
rror( A quantizing step of 0 is not allowed. ) ; 
本 


EG 2000 decoders simply invert the operations described previously. 
er decoding the arithmetically coded coefficients, a useT-Selected number 
e Original Image's Subbands are reconstructed.Ajthough the encoder may 
Ve arlithmetically encoded Ms bit-planes for a particular subband,the user 一 
due to the embedded nature of the codestream 一 may choose to decode only 
planes. This amounts to quantizing the coefficients using a step Size of 
6 A Any non-decoded blts are Set to zero and the resulting coef 人 fi- 
nts, denoted 多 (ui oj, are denormalized using 





( 殉 ( 克 可 二 2 5 A5 刺 ( 玫 轨 >0 
Ra 人 信 关 人 ( 殉 (02) 一 2 人 -As 砚 ( 本 人 二 0 
0 02) = 有 0 


ere Rob 2) denotes a denormalized transform coefficient and Ap(u V) is 
number of decoded bit-planes for gp(z vv). The denormalized coefficients 
ethen inverse transformed and level shifted to yield an approximation of 
eoriginal image. Custom function jpeg2k2im approximates this process, re- 
erIsing the coOmPTresslon of im2jpeg2k introduced earlier. 


避 23 
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jpeg2k2im function X = jpeg2k2im(y) mpute the inverse Wavelet transform and level shift， 
it %JPEG2K2IM Decodes an IM2JPEG2K compressed ;image， aveback(c，s，'jpeg9.7'，n) | 
% X = JPEGQ2K2IM(Y) decodes compressed image Y，reconstructing a iint8(X + 128) ; 
%%5 approxinmation of the original image X， Yis an encoding 
和 e princlpal diffterence between the wavelet-based JPEG 2000 System of 
。 .14 and the DCTbased JPEG system of Fig. 8.11 is the omission of the 


oO 


See alSo IM2JPEG2K， 
rror(nargchk(1，1，nargIn)); %5 Check input arguments 


53 Sublmiage procesSing Stages. Because wavelet transforms are both com- 


Get decoding parameters: Scale，quantization vector，run-length 


e 

先 

nd-of-data code，wavelet bookkeepir . . | 
ae Rn locking artifact that characterizes DCT-based approximations at high 
n 


double(y.n); omipresSion Iatios. 


qd = double(ly.9) / 100; 0 有 和 . 
Puns = double(y.runs) ; Figure 8.10 shows two JPEG 2000 approximations of the monochrome image EXAMPLE 8.9: 
rlen = length(runs); 时 Rig,8.4(a). Figure 8.16(a) was reconstructed from an encoding that compressed “JPEG 2000 


Zrc = -double(y.zrc) 

eoc = zrc 一 1; 

Ss = doublely.s); 

S = reshape(Ss， n+ 2，2); 


eoriginal image by 42:1; Fig. 8.16(b) was generated from an 88 :1; encoding， compression. 
yo results were obtained using a five-scale transform and implicit quantiza- 
th Ho 二 8 and so = 8.35 and 7,respectively Because im2jpeg2k only ap- 
ioximates the JPEG 2000s bitrplane-oriented arithmetic coding，the 
务 Compute the Size of the Wavelet transfornm， 
cl = prod(s(1，:) ) 
for TI = 2:n+1] 
二 DoS 2 
end 


2000 encoder. in fact, the actual rates would jncrease by a factor of 2. 

ce the 42:1; compression of the results in the left column of Fig. 8.16 is 
tical to the compression achieved for the images in the right column of 
8.13 (Example 8.8),Figs. 8.16(a),(c),and (e) can be compared--both qual- 
ely and quantitatively 一 to the transform-based JPEG results of Figs. 
b),(d),and (f).A visual comparison reveals a noticeable decrease of error 
the wavelet-based JPEG 2000 images. In fact, the rms error of the JPEG 
000-based result in Fig. 8.16(a) is 3.7 gray levels, as opposed to 4.4 gray Jevels 
:the corresponding transform-based JPEG resuit in Fig. 8.13(b). Besides de- 


泊 Perform Huffman decoding followed by zero _ run decoding， 
FF = huff2mat(y.huffman ) ; 


国医 zi1=TfTindtr == ZrCc) | 1 = 1 

for j = 1:length(zi) 
C= [crti:zl(j) 一 1) zeros(1，runs(r(zi(j) + 1)))]; 
工 


= zi(j) + 2; asing Teconstruction error，JPEG 2000-based coding dramatically in- 
end eased (in a subjective sense) image quality This is Particularly evident in 
zi = find(r == 8eoc); 生 Undo terminating zero_run .8.16(e). Note that the blocking artifact that dominated the coOIresponding 
if length(zi) == % or last non-zero run. form-based result in 了 ig. 8.13( is no jlonger present. 


ER hen the level of compression increases to 88:1 as in Fig.8.10(bj, there 


C= [czeros(1，cL - length(c))]; joss of texture in the woman's clothing and blurring of her eyes. Both ef- 
61Se cts are Vlslble in Figs, 8.16(b) and (人 . The rms error of these reconstruc- 
C=f[fcri:end)]; 


S ls about 3.9 gray levels.The resujts of Fig. 8.16 were generated with the 





em oOwing sequence of commands: 
% Denormalize the coefficients . 
C=Ccr+(c>0)- (cs<0); = imread(' Tracy.tif' ) ; 
for k = 13n = lm2jpeg2k(f，5，[8 8,5])， 
qi=3rxrk 一 2; = jpeg2Kk2im(cl) ; 
C = Wavepastel h ，C，S，Kk，Wavecopy( h' ，c，S，k) * g(9qIi)) rms1 = Compare(f，Tf1) 
C = Wavepaste('v':，c，S，k，wavecopy('v':， cs，k) * q(qi + 1) 二 
C = Wavepaste('d' ，cC，S，K，Wavecopy('d'，c，S，Kk) * q(qi + 2))5 3 .693|1 


end 


cC = Wavepaste('a'，c，S，K，wWavecopy('a'， 6，S，K) > q(qi + 3)); “crl = imratio(f，c1) 
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3a PEB 

Cd 

e-: 于 

FIGURE 8.16 Letft 
column:JPEG 
2000 
approximations of 
Fig.8.4 using five 
scajes and implicit 
quantization with 
Ho 一 8and 

= 8.3. Right 

column': similar 


results with 
20 一 人 




















42.1589 


im2jpeg2ktf，5，18 7]) |; 
jpeg2Kk2im(c2) ; 
= Compare(f，f2) 


5,.9172 
r2 = imratio(fT，Cc2) 


87.7323 


要 ee they must be ordered by decomposition level (first， sec- 
， third,...) andby subband type (ie.,the horizontal, vertical, diagonal, and 


Im2jpeg2k(f，1，[I1 1 1 4] 7 


omputes a one-scaje transform and employs expjlicit quantization 一 all four 
Ubbands are quantized using step Size Al = 1.That js, the transform coeffi- 
lents are IOounded to the nearest integeI. TIhis is the minimal error case for 
he im2jpeg2k imnplementation, and the resulting rms error and compres- 
ion rate are 


f3 = jpeg2k2imf(c3) ; 
PFAms3 = Compare(f，Tf3) 


1 .1234 

cr = limratio(f，Cc3) 

3 = 

1.6350 瑞 


e ematerial in this chapter introduces the fundamentals of digital Image coOmPrIesslon 
AIough the removal of coding, interpixel, and psychovisual redundancy MATLAB 
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cornerstone of the mathematical set of tools underlying the development of 
niques that extract“meaning”from an imasge. Other approaches are de- 
oped and applied in the remaining chapters of the book. 


[天 Preliminaries 


thjs Secuon We introduce some basic concepts from set theory and discuss 
application of MATILABYs logical operators to binary Images. 


. 5ome Basic Concepts from Set Theory 


et Z be the set of integers. The sampling process used to generate digital im- 
geS may be viewed as partitioning the xy-plane into a grlid, with the coordi- 
es of the center of each grid being a pair of elements from the Cartesian 
oduct， Za In the terminology of set theory a function F(x, y) issaidtobea 

di8gttal id8e 这 (人 )) are integers from Zand 六 is a mapping that assigns an 

tensity Value (that is, a real number from the set of real numbers， 尺 ) to each 

stinct pair of coordinates (zx, 7). 入 the elements of R also are integers (as is 

ually the case in this booKk), a digital image then becomes a two-dimensional 

nction whose coordinates and amplitude (Le., intensity) Values are integers. 

JIet 4 be a set in Z“, the elements of which are pixel coordinates (2x, y). 下 

(z, yis an element of 4,then we write 





PP7eTTez0 


The word morppology commonly denotes a branch of biology that deals Wi 
the form and structure of animals and plants.We use the same word here in 也 | 
context of aipzemaatical 1aomphology as a tool for extracting image compo 
nents that are Useful in the representation and description of region shape 
such as boundaries, skeletons, and the convex hull, We are interested also if 
morphological techniques for pre- Or postprocessing, such as morphological 
hltering, thinning,and pruning. 
In section 9.1 we define several set theoretic operations, introduce binary 
Images, and discuss binary sets and logical operators. In Section 9.2 we define 
two fundamental morphological operations, diiaiiom and erosiom in terms 0| 
the unlon (or intersection) of an image with a translated Shape (SIrUCILE8 寻 
eme1b. Section 9.3 deails with combining erosion and dilation to obtain moOIe 
complex morphological operations. Section 9.4 introduces techniques for 1a: 
bejling connected components in an image. This is a fundamental step in eX: 


EL 

Inilarly, 过 is not an element of 4, we write 

区 4 

set 下 of pixel coordinates that satisfy a particular condition is written as 
中 = {faolcondition} 


Or exampje, the set of all pixel coordinates that do not belong to set 4, de- 
oted 4 is given by 


4 = {2l 如 ec4)} 


S Set ls called the comzplerzet of 4. 


element, as js the case in Sections 9.1 through 9.4. Section 9.6 extends mOrpho- 
loglcal concepts to gray-scale images by replacing set union and intersection 
with maxima and minima. Most binary morphological operations have natural 
extenslons to gray-Scale processing. Some, like morphological reconstruction， 
have applications that are unique to gray-scale images, such as peak filterling: 

The material in this chapter begins a transition from image-pProcessing 
methods whose inputs and outputs are images, to image analysis methods; 
whose outputs in Some way describe the contents of the image. Morphology is 


the set of al elements that belong to either 4, B, or both. Similarly the 
[ersecfion of two sets 4 and B is the set of all elements that belong to both 
Sets, denoted by 






e Cartesian product of a set of integers, Z, is the set of all ordered pairs of elements (zi 2) with zi 
:nd zi being integers from Z. Itis customary to denote this set by Z2 
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abc 

Ge 
FIGURE 9.1 
(aij JTJwo sets 4 
and 刀 .(b) TIhe 
unlion of 4 and 妃 . 
(c) The 
Intersection of 4 
and DB.{d)The 
complement of 4 . 
(e) The difference 


betweena 4 and 瑟 . 


二 二 
FIGURE 9.2 

(a) Iransiation of 
4 by z. 

(b) Reflection of 
.TIihe sets 4 and 


妈 are from 
Fig, 9.1， 
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Binary Images, 9ets, and Logical OPperators 


language and theory of mathematical morphology often Present a dual 
2w of binary Images.As in the rest of the book,a binary image can be viewed 
bivalued frrction of x and /. Morphological theory views a binary Image 
ihe set of its foreground (1-valued) pixels, the elements of whjich are mn 冯 ” 
perations Such as unlon and intersection can be applied directly to binary 
age Sets. For exaimple, 让 4 and 有 are binary images， then C=A4UBialso 
ary image, Where a PlxelinCis aforeground pixel if either OF both of the 
esponding pixels 记 4 and 呈 are foreground pixels. In the first view,that of 
ction, C is glven by 





及 门 召 





1 于 either 4(xz y) or 到 (xc yi1lor 直 both are 1 


0 0 otherwise 





ing the set Vilew, on the other hand,C is given by 


C={xzrn(zys4orlzyesBorlye(4andD)} 


ES 《4 


Jhe dzjerence of sets 4 and B, denoted 4 -- Bisthe set ofal elements 姓 
belong to 4 but not to 如 : 


ie set opeIations defined in Fig.9.1 can be performed on pinary images using 
MATLAB's Iogical operators OR (|), AND (由 ,and NOT (-), as Table 9.1 
， As a Simple ijustration, Fig. 9.3 Shows the results of applying several logical 
Tperators to two binary images containing text. (We follow the IPT conven- 


4 一 了 ={lws4weB5} 


Pigure 9.1 illustrates these basic set operations.The result of each operationi 
Shown in gray. 

in addition to the preceding basic operations， moOrphological operati 
often require two operators that are Specific to sets whose elements are PE 


内 


coordinates,. IJhe repecrion of set 下 denoted 有 ,is defined as ig, 9.3(e)] consists of the Pixels where the jetters in“UTK”and“GT”overT- 


Finally, the set difference image [Fig, 9.3( 介 ] shows the letters inm“UTK” 

有 = {olw = -bforpe 万 } ith the pixels“GT"”removed. 

The tanmsiation ofset 4 bypoint z = (zl, zzj, denoted (4),,is defined as 
(4)j:={fclc=a+zforas4} 


Figure 9.2 齐 ustrates these two definitions using the sets from 站 The 
black dot identifies the origim assigned (arbitrarily) to the sets. 


MAILAB 世 xpression 
for Binary Jmages 







Set Operation 





AND 
OR 

NOT 
DIFFERENCE 






A18 
灵 -A 





(4): 


?9.2 江 Dilation and Erosion 


TABLE 9.1 
Using logical 
eXPIeSSIOnS 1 
MATLAB to 
Perform set 
Operations on 
binary 1mages. 
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人 a hb 
上 和 C 
站 | 
和 
有 FICURE ?4 
人 Jiustration of 
和 dilaton. 
本 人 和 (ai Original image 
有 1 with rectangular 
用 :1 Object. 
人 -人 (b) Structuring 
村 全 全 element with five 
上 Pixels arraneed in 
a dagonal line. 
The structuring element translated to The origin of the 
these locations does not Overlap any structuring 
1-valued Pixels in the original image. element is shown 
with a dark 
border. 
(C) StrUcturing 
eljement 


trfanslated to 

Seyeral jocationas 

1 on the Image. 

> When the origin is (d) Output imnage. 

| translated to the 
“”!ocations, the 


守 [ L(j 人 





-oem 和 -一 -- 一 和 ee 


3 b CC imasge. 
中: 全 江 

: : 1 下 /于 下 下) 下) 人 

FIGURE 9.3 (a) Binary image A. (b) Binary image 8B. (c) Complement -A. (d) Union A 」B. (e) Intersectioi 1 ， ， ee 

A&B.( 世 Set difference AR& -B. } 必 1 人 年 汉 和 放 

上 SO 人 

时 直人 

3 Dilation 是 

和 111 1 全 

Diatom is an operation that“prows”or'“thickens” objects in a binary image 人 

The specific manner and extent of thjs thickening is controlled by a shape re ee | ww ee 

、 1 让 ; 

ferred to as a St1rtucttrizpg elemazertt Figure 9.4 jllustrates how dilation works mnODOiltlllllonnfinhuoa 

Figure 9.4(a) shows 3 Simpie binary image contalning a rectangular object 人 

Figure 9.4(bj is a structuring element, a five-pixel-long diagonai line in this 人 

人 


《entitied Figure 9.4(b) shows the origin of the structuring element using ev tion of 4 by Bisthe set consisting of all the structuring element origin loca- 
0nS where the reflected and translated 妃 overlaps at least some portion of 4. 
Jhe translation of the structuring element in dilation js simijlar to the mechan- 
cs of spatial convolution discussed in Chapter 3. Figure 9.4 does not show the 
Tucturing element's reflection explicitly because the structuring element is 
ymmetrical with respect to its origin in this case. Figure 9.5 shows a nonsym- 
netric structuring element and its reflection . 

_ Dilation is comzzzatiative:that is, 4 约 忆 = 有 四 4.Itisaconvention in image 
Processing to jet the first operand of 4 人 旨 8B be the image and the second 


and checks to see where it overlaps with 1-valued Pixels. Ihe output image In :全 
Fg. 9.4(d) is 1 at each location of the origin such that the Structuring element 
overlaps at least one 1-valued pixel in the Input image， 

Mathematically dilation is defined in terms of set operations. The dilation 
of 4 by B,denoted 4 征 Bis defined as 


498={zl(8).nm4>g) 
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竺 王 


FIGURE 9.5 
Structuring 
element 
reflection. 

(a) Nonsymmetric 
StructurIng 
element. 

(b) Structuring 
element reflected 
about lts oOTlg1n. 


了 入 AMPEE 9.1: 
A slmple 
apPlication of 
dilation. 


人 
PP 


习 钼 地 Ar 

5 局 
人 二 
2 


过 inate 
AR 


Sb 

FIGURE 9.6 
AsSimple example 
of dilation， 

(a) Input Image 
contalining broken 
text.【b) Dilated 
image. 























gtructuring Element Decomposition 


币 jation js CSyOciatyve. TIhat is， 


4@(B@C) =(4@B)@C 


)p0S6 that a structuring element 妃 can be Tepresented 3S 3 dilation of two 
cturing elements B1 and D2: 


已 = PP1 人 外 D 


on 4 国有 = 4@(5 申 忆 ) = (4 四 及) 四 杞 . 年 other words, diiating 人 
iih Bis the same as first dilating 4 with Bl, and then dilating the result with 
Wesaythat 了 can be decomapo5ed into the structuring elements B1 and 已 >， 
he assocjative Property is important because the time Tequired to com- 
te dilation is Proportional to the number of nonzero PIxejls in the StTUCctuIing 

ent. Consjider, for example, dilation witha3 X 9 arTay oft 1S: 


operand be the structuring element, which usuajly is much smaller than 机， 
image. We follow this convention from this point on. 


滤 IPT function imdilate performs dilation. its basic calling Syntax 1S 


A2 = indilate(A，B) 


where A and A2 are binary images,andBis amatrix of 0s and ls that specifie 11 1 了 虐 工 
the structuring element. Figure 9.0(a) shows a sample blnary image containin 句 1 1 1 1 
text with broken characters. We want to use imdilateto dilate the Image Wi 1 1 1 1 
the structuring element: 111T 11 
1 0 本 业 业 寺 
] 0 s structuring element can be decomposed into afive-element row of 1s and 


five-element column of 1S: 


The following commands read the image from a fiie, form the structuring ele 
ment matrix, perform the dilation, and display the result， 


>> 人 扩 imread( ' broken text+t.tIf ) | 
>>B= [010;111; 010]，; 

>> A2 = lmdillate(A，B) 

>> 1ImSsShow(A2 ) 


1 
ifitie 

T 

1 










enumDer ol elemeT 2.but the total 


Imber of elements In_the. ow-column decomposition js onjy 10. Ipls_means 
that qilation with the row structuring_element first, followed by dilation_with 
column element, can be performed 2.5 times faster tpan dlation with the 
X > arTTay of 1S. Im practice， the speed-up willbe somewhat jess because there 
iually some oOveIhead associated with each dilation operation, and at least 
dilation operations are Tequired when using the decomposed form. 这 Oow- 


er,the gain in speed with the decomposed implementation 18 stilj significant. 


Figure 9.6(D) Shows the resulting image. 


划 和 
上 1 ai 手 征 庆生 


Historically，certain Computer : 
programs were Written USing 
oniy two 由 gits rather than 
four to define the applicable 
Year。Accordingiy，the 
company's 50ftWare may 
recognize 3 date Using "00 

as 1900 rather than the Year 
2000. 


tjistorcaly，cRrtaitn compbuter 


Programs wete Written USIng 


Oniy twWa 中信 全 节日 拉 人 人 七 全 已 人 
fourtc define rpha appPicabie 
yeSr。，ACCOrdinOiY，te 

2.3 The strel Function 


function strel constructs structuring elements with a Variety of Shapes 
SlZzegs. Its baslc SYyntax 1S 


conpany's 58ftWBrG 站 是 多 
regeqnize 号 寺 atsa USinG 00- 
总 SS 圭 996 fat 的 有 than te Year 


200690. 


se = StrelLl(sShape，parameters ) 
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上 上 和 XAMPLE 9.2: 
Anliiustration of 
structuring 
element 
decomposition 
USIing stre1l. 


getsequence 







where shape js a string specifying the desired Shape, and parameters is 
of Parameters that specify information about the Shape, such as its size Fo 
ample, strel( 'diamond'，5) returns a diamond-shaped Structuring elem 
that extends 士 S pixels along the horizontal and Vertical axes. Table 92? 
marlzes the various shapes that strel can create 

In addition to simplifying the generation of common StTUcturing elem 人 @ 
shapes, function strel also has the Important property of producing strucftt 
ing elements in decomposed form. Function imdilate automaticaljy use 
decomposition information to speed up the dilation Process.The following 
ampjle llustrates how strel returns information related to the decompos 
of a structuring element， 





跨 Consider again the creation of a diamond-shaped structuring element USits 
strel: 二 


>> Se = Strel( diamond'，5) 

Se = 

Flat STREL object containing 61 neighbors， 
Decomposition: 4 STREL objects Containing a total of 17 neighbor 
Neighborhood : 


呈 呈 口 口 吕 一 口 口 口 口 惠 
呈 呈 吕 己 一 一 一 品 虽 口 吕 
呈 吕 忆 一 一 一 一 下 吕 口 口 
呈 呈 一 一 一 一 一 一 一 口 串 
忆 一 一 一 一 一 一 一 一 一 口 
呈 口 一 一 一 一 一 品 口 口 
口 口 口 口 一 口 口 口 口 虽 


Shaped pattern in this casej; the number of 1-valued Pixels in the structuring 
element (61); the number of Structurimng elements in the decomposition (4 


>> decomp = getsequence(se) | 


>> Whos 
Name S1LZze ByteSs ClLass 
decomp 4X1 1716 strel object 
Se 1X1 3309 strel object 


Grand total is 495 elements USing 5025 bytes 


Syntfax Forms 


strel( ' diamond ' ，R) 


se= strel('disk'，R) 













=SsStrel( 1ine'，LEN，DEG) 


= StPrel( ooctagon' ，R) 


se=Sstrel('pair'， OFFSET) 


ie= strel('periodicline'，P，V) 


8e=strel( rectangle'，MN) 


Se=Sstrel('Square'yW) 


se =Strel('arbitrary' ，NHOOD ) 
Se= Strel(NHOOD) 
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Descripfion 






Creates aflat, diamond-shaped 
Structuring element, where R specifies the 
distance from the structuring element 
ongin to the extreme points of the 
diamond. 
Creates aflat, disk-shaped structuring 
element with radius R.(Additional 
Parameters may be specified for the disk: 
See the strel help page for details.) 
Creates a flat, linear structurinpg element， 
where LEN specifies the length, and DEG 
Specifies the angle (in degrees) of the 
line, as measured in a counterclocKkwise 
direction from the horizontal axis. 
Creates a fiat, octagonal structuring 
element, where R specifies the distance 
from the structuring element origin to the 
sides of the octagon, as measured along 
the horizontal and vertical axes,R must 
be anonnegative Immultipie of 3. 
Creates a flat structuring element 
containing two members. One member is 
located at the origin.The second 
menmber's location is specified by the 
vector 0FFSET, which maust be a two- 
element vector of integers. 
Creates a flat structuring element 
containing 2*P + 1 members.V is atwo- 
elerment vector containing integer-valued 
IOow and column offsets. One Structuring 
element member is located at the origin. 
The other members are located at 1x*V， 
一 *V，2*V，-2x*V，..,，P*V,and ~Px*V， 
Creates a flat,rectangle-shaped 
Structuring elemaent where MN SPeclfies 
the size. MN must be a two-element vector 
of nonnegative integers.The first element 
of MN is the number rows in the 
structuring eljement; the Second element 
is the number of columns. 
Lreates a Square Structuring element 
whose width is W pixels.W must pe a 
nonnegative integer Scalar. 
Creates a structuring element of 
arbitrary shape.NHOOPD is a matrix of 
Us and ls that specifies the shape.The 
Second, Simpler syntax form Shown 
perftorms the Same operation. 


TABLE 9.2 

The various 
Syntax forms of 
function strel1. 
(CIbe word jar 
means that the 
Structuring 
element has zero 
helght. Thls js 
meaningful only 
for gray-scaje 
qilation and 
eroslon. See 
sectlon 9.6.1.) 
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The output of whos shows that se and decomp are both strel objects 3 
further, that decomp is a four-element vector of strel objects.The four struct' 
ing elements im the decomPposition can be examined individually by indey 


1 and extent of shrinking is controlled by a structuring element. Figure 9.7 计 - 
Into decomp: 


itrates the erosion process.， Figure 9.7(a) is the Same as Fig. 9.4(a). 
>> qdecomp (1 ) aphicajly depicts erosion as a PIocess of translating the structuring element 


angs = bughout the domain of the image and checking to See where L fits entirely 


Flat STREL object containing 5 nelghbors ， 


Nelighborhood : 600000000000000059 
0 1 0 作 再 人 9 
1 ] ] 让 人 人 
0 0 000000090900900001 
中 让 了 
>> decomp(2】 00000111111100000 | 
下 1 


ans = 


Flat STREL object containing 4 neighbors ， 0n0000000000000000 


有 人 和 站 日 有 有 0009909089909 










9 0 
0 1 0 
1 0 1 Output is zero in these locations because 
0 1 0 the structuring element Overlaps the 
background. 
>> decomp(3) 
1 有 \ | 二 | | 
ans = 国 副 量 本 
Flat STREL object containing 4 neighbors， 下 | 一 门 一 
Neighborhood : | 
0 0 0 0 一 人 | 
1 0 0 0 1 本 人 
O 0 O 0 0 se 人 
0 0 1 0 0 1Outputis one here because 
|， | thestructuring ejement fits ] 
>> decomp(4) [人 entirely witpin the foreground. 
ans = 上 -人 人 


和 0 


Flat STREL object containing 4 neighbors . | 


Neighborhood : 


人 站 和 人 
0 1 0 人 人 
1 0 1 站 DO011111110008 0 
0 1 O 站 


Function imdilate uses the decomposed form of a structuring element 3U- 0 人 
tomatically performing dilation approximately three times faster (s 61/17) 本 让 
than with the non-decomPposed form， 
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全 全 

FIGCURE 9.7 
Jiustration of 
eTOS1On， 

(al) Orliginal image 
with rectangular 
object. 

(bj Structuring 
element with 
three DPIXels 
arranged in 3 
vertical line, The 
orlgin of the 
structtring 
element 1S Shown 
with a dark 
border. 

(c) Structuring 
element 
translated to 
several ljocations 
on the lmage. 


(d) Output Image， 
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玉 XARNUDPLE 9.3: 
Anillustration of 
eroOSsion， 


3 世 : 

c- 人 

FIGURE 9.8 An 
illustration of 
etroslon. 

(a) Original 
Image， 

{(b) Erosion with a 
disk of radius 10. 
(c) Eroslon with a 
disk of radius 9， 
(d) Erosion with a 
disk of radius 20， 


anyofthe image background). 




























= imread( wirebond mask.tif') ; 
strel( disk ，10) 
A2 = imerode(A，Se) | 

mshowtf(A2 ) 


within the foreground of the image.。 The output imapge in Fig.9.7(dqibhas ava 
of 1 at each location of the orlgin of the Structuring element, Such that the et 
ment overlaps.On 1]-valued pixejs of the inpui Image ， 1L.e.lt doesTot oye 


Se 








The mathematical definition of erosion ls Simliar to that of dilation. TI 
erosion of 4 by B,denoted 4 日 B,is defined as 


468B={fzlHB):m 4 关 } 


In other words, eroslon of 4 by Bisthe set of all structuring elemaent origin 到 
caftions where the translated 下 has no overlap with the background of 4. 


strelf( disk' ，5); 
imerode(A，Se) ; 
imshow(A3 ) 


0 
“oo oo 
上 导 


器 Erosion ls Performed by IPT function imerode. SupPose that we want 
remove the thin wires in the image in Fig. 9.8(aj,but we want to PITeserve 
other Structures. We can do thls by choosing a structuring element Smah 
enough to fit within the center Square and thicker border leads but too largete 
fit entirely within the wires. Consider the following commands: 


e of the wire jeads were not removed in this case. Figure 9.8(d) shows 
thappens if we choose a structuring element that is too large: 


县 和 Combining Dilation and Erosion 


practical Image-pIocessing applications, dilation and erosion are used most 
n in various combinations. An image will undergo a series of dilations 
d/or erosions using the same, or Sometimes different， Structuring elements. 
his section we consider three of the most common combinations of dilation 
eroslion: Opening, closing, and the hit-or-miss transformation. We also in- 
Koduce lookup table operations and discuss bwmorph, an IPT function that 
an perform a variety of practical morphological tasks. 





Opening and Closing 





Ane maorpPpologica7 Opezi1g of 4b 
已， followed by dilatlon of tb&e resujt by_ 忌 - 


4 了 =(48) 引 8 
nalternative mathematical formulation of opening is 
4"8= U{(B)z(5).EAI) 


here LU {T} denotes the union of all sets inside the braces, and the notation 
5 D means that Cis a subset of D.This formulation has a simple geometric 
iterpretation: 4eBis the union of all translations of B that fit entirely within 
Figure 9.9 iustrates this interpretation. Figure 9.9(a) shows aset 4 and a 
sk-shaped structuring element 万 . Figure 9.9(b) shows some of the transla- 
ons of B that ht entirely within 4.The union of all such translations is the 
aded region in Fig. 9.9(c); this region is the complete opening.The white re- 
ons In this figure are areas where the structuring element coujd not fit 
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Chopter 9 选 Morphological Image Processing 
2 Translates of B inside 4 
人 
Translates of 如 
outside 4 
十 :全 
FIGURE 9.9 Opening and closing as unions of translated structuring elements. (a) Set 4 and stmu 
turing element 如 (b) Iranslations of 吾 that fit entirely within set 4. (c) The complete ope 
(shaded). (d) Translations of 已 outside the border of 4 《e) The complete closing (shaded)， 
This example iustrates the use of functions imopen and imclose. TIhe 
completely within 4, and, therefore, are not part of the opening, Morphol 油 ht 2 ee 2 有 | 0 和 es 站 
cal opening Iemoyss completely regions of an object that cannot contain 出 机 和 ee ne 
Structuring_ element smoothes object contours， breaks thin connections， and On ee 
removesihin protrusions 7 gllowing commands open the image with a 20 X 20 structuring elementt: 
The mzorpPpol ! ; 
2 08ical closi18g of 4 by 有 .denoted 4.Biisadilation followed f = imread('shapes.tif'); 
an So 1 1 
Se = Strel( Square ，20) 
4.B=(4@B)GB > fO = Imopen(Tf，Se) 
imSshow(Tfo ) 
Ceometrically 4 .Bis the complement of the union of all translations of 如 
that do not overlap 全. Figure 9.9(d) Hustrates several translations of B that 9 igure 9.10(b) shows the result. Note that the thin protrusions and outward- 
not overlap 4. By taking the complement of thpe union of all such translationg， ointing boundary irregularities were Temoved. The thbin join and the Small 
We obtain thbe Shaded Icegion if Fig. 9.9(e), which is the complete Likg ms0lated object were TeImoved also. The commands 
f 
Di > fc = imclose(f，se); 
fills holes smallsrtan cturing element > imsnow(Tfc ) 
Openling and closing are implemented in the toolbox with functions imopen 
and imcliose,. TIhese functtons have the Simple Syntax forms roduced the result in Fig. 9.10(c). Here, the thin gulf the inward-pointing 
3 oundary irregularities, and the Small hole were Iemoved. As the next parIa- 
SR CC = imopen(A，B) aph shows, combining a closing and an opening can be quite effective in re- 
moving noise. In terms of Fig. 9.10, performing a closing on the result of be 
and earlier opening has the net effect of Smoothing the object quite significantiy 
本 e close the opened image as follows: 
9S8 C = imclose(A，B) 







> foc = imclose(fO0，Se) ) 
where Als abinary Image andBis amatrix of 0s and ls that specifies the struc- > imshow(foc ) 


turing element. Asastrel object, SE, can be used instead of B. 
Higure 9.10(d) shows the resujlting smoothed objects. 
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二 hb 
心 柯 


FIGURE 9.10 
Illustration of 
opening and 
closing. 

(a) Original 
Image. 

(b) Opening. 

(c) Closing. 

(d) Closing of (b). 


卫 XAMPLE 9.4: 
Working with 


_ functions imopen 


and Imc1lose，. 
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这 a hb 
1 CC 
人 2 dae 
人 二 下 熙 
四 ] 多 
1 ] 
ee 全 和 时 上 二 人 下 汪 下 | FIGURE 9 ]2 
0091lt1l11600D6b800n0 ef 1 个 遇 放 汪汪 
0 1 人 (al) Original image 
Di 和 刘 和 4.(b) Structuring 
人 se = 
| ， 
RE 人 (c) Erosion of 4 
日 昌 和 有 7 bv 召 
全 ie 本 四 Complement of 
。 SA 。 ， 和 . - ， 
FIGURE 9.11 (a) Noisy fingerprint Image. (b) Opening of image. (c) Opening followed by closing. (Origin 冰 攻 人 二 和 人 


Image courtesy of the National Institute of Standards and Technology.) ] 开 
人 和 eiement 性. 


(上 电 Erosion of 4 
by 已 .(g) Output 


Figure 9.11 further illustrates the usefulness of closing and opening bya 
Image. 


plying these operations to a noisy fingerprint [Fig.9.11(a)].The commands, 


>> 千 = imread( fingerprint .tif') ， 二 
>> Se = Strel(' square'，3)， 二 
>> fo = imopen(f，se); 上 局 
>> imshow(Tfo) 有 本 
人 1 | 
。 人 二 于 于 
produced the image in Fig. 9.11(b). Note that DO1SY Spots were Iemoved | 111100ul1l11111111 而 
openlng the image, but this process introduced numerous gaps In the ridgesao 村 是 且 本 1 1 
1 


the fingerprint. Many of the gaps can be flled in by folowing the opening Witk 


a closing: 
人 
本 1 和 1 全 乡 nlT1LTIT1L1 
>> foc = jimclose(foyse); 人 
>> LImshow(Tfoc ) 上 
| 证 
Figure 9.11(c) shows the final result. 0 
1111a511111110101 
.3.2 The Hit-or-Miss Transformation 。 1 1 


or-mlss transformation of 4 by Bis denoted 4 凶 有 8 Here, Bisa structuring 人 
ejement pair, 已 = (有 1， B2),rather than asingle element as before The hit-or- 人 
IOn 1 1 人 个 和 :证 
Imlss transformation is defined in terms of these two Structuring elements 3S 1 ， Se 和 ， 。 
| (1 ft 所 1 

4 拘 肛 二 465)mm(4e 记 ) 时 

和 有 


Pigure 9.12 shows how the hit-ormiss transtormation can be used to identL-- 


ty the locations of the following cross-shaped pixel confi Urationi: 
Figure 9.12(a) contains this configuration of pixels in two different locations， 


0 1 0 TOSion With structuring element Bi determines the locations of foreground 
| plxejls that have north, east, South, and west foreground neighbors. Erosion of 
0 1T 0 he complement with structuring element 号 determines the locations of all 


he pixels whose northeast, southeast，southwest， and northwest nelghbors 
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h single-pixel dot in Fig.9.13(b) is an upper-left-corner Pixel of the objects 


belong to the background. Figure 9.12(g) Shows the intersection (log 光 
折 ig ,9.13(a). The pixels in Fig. 9.13(b) were enlarged for clarity， 油 


AND) of these two operations. Each foreground Pixel of Fig.9.12(g) is the 
cation of a set of pixels having the desired configuration. 

The name “hlit-of-malss transformation” ls based on Row the result is a 
ed by the two eroslons. 上 oL example, the output Image in Fig. 9.12 consists ben the hit- OI-ImI1SS structuring elements are Small, a faster way to compute 
all ljocations that match the Plxels in Bi (abhat”) and thatPpaveions 5 ED 

els in 刀 2 (a mlss .Strictly speakling, 及 记 a11d-111158 tra1sjfoPrzaatiopnis amaore 
“curate name, but Por-7aiss ZQNSjO77CUO7 1S Used more frequentiy. 

Jphe hit-ormiss transformation js lmplemented im IPT by fun 0 

bwhitmiss, which has the syntax 



































0 the output plxel value for every possSible neighborhood config- 
ion and then Store the answers in a table for later use. For instance, there 
2? = $12 different 3 X 3 configurations of pixel values in a binary Image. 
ob make the use of lookup tabjes Practical, we Imust assign a Unique index 
ch possible configuration.A simple way to do this for, say, the 3 X 3 case， 


4 = bwnitmnlSS(A，B1，B2) o mujtiply each 3 X 3 configuration element-wise by the matrix 





where Cls the result,Als the input image,and B1 and B2 are the structuring 
ements just discussed. ， 2 0 -128 
4 32 236 





EXAMPLE 9.5: 器 Consider the task of locating upperT-left-corner pixels of objects in an image then sum all the products. This procedure assigns a unique value in the 
ee using the hitrormiss transformation. Figure 9.13(a) shows a simple image con se [0,511] to each different 3 X 3 neighborhood configuration. For exam- 
ee talnling Square Shapes. We want to locate foreground pixels that have east an le the value assigned to the neighborhood 
South neighbors (these are“hits”) and that have no northeast, north, noOT 
West，wWest, OT Southwest heighbors (these are“misses”). These requireme 人 
lead to the two structuring elements: 1 0 1 
1 0 1 
en 10G) + 2(D + 4() + 8() + 16(0) + 32(0) + 64(0) + 128(1) + 
0 %96(1) = 399, where the first number in these products is a coefficient from 
N ; ， ， 了 e preceding matrix and the numbers in parentheses are the plxel values， 
ote that nelither structuring element contalins the Southeast nelghbor, whic 人 
1 0 0 1 6 人 人 9 几 0 The toojbox provides two functions, makelut and applylLut (illustrated 
人 ter in this section), that can be used to implement this technique. Function 
三 ;1 . makelut constructs a lookup table based on a user-SsuUppjied function, and :和 ma 
， Te 国人 pL1y1Uut Processes binary es using this lookup table. Continuing with < 
ie3 x 3 case,using makelut requires writing a function that acceptsa3 义 3 
a b mnary matrix and returns a Single value， typically either 3 0 OT 1 Function 
FIGURE 9.13 akelut calls the User-Ssupplied function 512 tmes, Passing each possibjle 
(a) Original 3 neighborhood.It records and returns all the results in the form of a >]2- 


ment vector. 

Asanillustration, we write afunction,endpoints.m,that uses makeJUt and 
pplLyLut to detect end points in a binary image, We define an enad POL a8 a 
breground pixel that has exactly one foreground Deighbor, 上 unctlon 
ndpoints computes and then applies a lookup table for detecting end points 
an input image. The line of code 


image. (b) Result 
of applyimng the 
hit-or-milss 
transformation 
(the dots shown 
were enlarged to 
facilitate viewing). 


persistent JUt 





sedin function endpoints establishes a variable called Lut and declares it to 
persiste1 凡 MATLAB “remembers”the value of persistent variables in be- 
en function calls.The first time function endpoints is called, varlable JUt 
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endpointSs 
ee 
JSee JeclO 了 42 jpr 


2cligctts3iom OF Perc- 
ion Pardcie Q. 


下 XAMPLE 9.6: 
Playing Conway”s 
Game of Life 
using binary 
Images and 
lookup-table- 
based 
computation. 
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1 automatically initialized to the empty matrix ({ ]). When lut is empty tha ab 
function calls makelLut，passing it a handle to subfunction endpoint_ fcrs FIGURE 9.14 
Function applylut then finds the end points using the locokup table, Ths (a) Image of a 
jookup table is saved in persistent variable lut so that， the _ next tim 攻 人 
。 SKelieton， 
endpojints 18 called, the lookup tabjle does not need to be recompnuted 全 
function 


function g = endpoints(f) 

<%ENDPOINTS Computes end points of a binary image.， 
名 G= ENDPOINTS(F) computes the end points of the binary image 下 
5 and returns them in the binary image G， 


endpoints. The 
PIXels in (b) were 
enlarged for 
ciarlty. 


perSiStent Jut 


if IsSempty(1Lut') 
LIut = makelut(eendpoint fcn，3) ; 
end 


g = applyjiut( 于 ，JLut) ; 


function is_end_point = endpoint fcn(nhood) 
betermines 计 f a pixel is an end point ， 
IS_END_POINT = ENDPOINT_FCN(NHOOD) accepts a 3-by-3 binary 
neighborhood，NHOOD，and returns a 1 评 the center element js 
end point; otherwise it returns 3a 0， 


ob ob8 cp 纪 


LSs_end_pojint = nhood(2，2) & (Sum(nhood(:)) == 2); 


Figure 9.14 贡 ustrates a typical use of function endpoints， Figure 9.14(a) 
a binary image containing a morphological skeleton (see Section 9.3.4) an 
Fig. 9.14(b) shows the output of function endpoints， 














To Implement the game of life Using makelut and appLylut, we first write 
function that applies Conway's genetic laws to a Single Pixel and its3 X 3 
elghborhood: 


疯 An interesting application of lookup tables is Conway's “Game of Life” 
whlch involves“organisms”arranged on a rectangujar grid. We include it here 
as another illustration of the power and simpjlicity of lookup tables. There are 
Simpjle rules for how the organisms in Conway's game are born, Suryvive， and 
die from one“generation” to the next, 人 binary image is a conyenient Tepre: 
Sentation for the game, where each foreground pixel represents a living OfgaI: 
isSm in that location. 
Conway's genetic rules describe how to compute the next generation (OF 
next binary image) from the current one: 


Unction Out = Conway1laws(nhood) 

CONWAYLAWS Applies Conway's genetic laws to a Single Pixel， 

0UT = CONWAYLAWS(NHO0D) applies Conway's genetic jaws to a Single 
pixel and its 3-by-3 neighborhood，NH00D， 


m_nelghbors = Sum(nhood(:)) - nhood(2，2)， 
nhood(2，2) == 
if num_neighbors <= | 

out > 0; 点 Pixel dies from isolation. 
elsejif num_neighbors > = 4 

out = 0; 各 Pixel dies from overpopulation， 


Conway1aws 


本 


1，Eyvery foreground pixel with two or three nelghboring foreground Pixels: 
SUIVIves to the next generation. 

2 和 Eyery foreground Pixel with zero, one, or at least four foreground nelg 
bors“dies”(becomes a background pixel) because of “isolation” of 


Out = 1; 各 Pixe]l SUPvives， 


“overpopulation ” if num_neighbors == 3 
3 Every background pixel adjacent to exactly three foreground neighbors it: out = 全 各 Birth Pixel， 

a birth”Pixel and becomes a foreground pixel， 1 t = 0; % Pixel remains empty。 
Al births and deaths occur simultaneously in the process of computing 也 end 


next blnary image depicting the next generation， 
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The ljookup table 1S constiructed next by calling makelut with a fung Drint Re TABLE 9.3 
handle to conway1laws: 人 Operations 
“Bottom-hat”operation usinga3 X 3structuring element; USe Supported by 
imbothat (see Section 9.6.2) for other structuring eljements. function bwmorph. 


>> JUt = makelut(econwaylLaws，3) j 
Connect Pixels Separated by single-PIXel gaps. 


Remove isolated foreground Pixels. 

Ciosing usinga3 X 3 structuring eljement; use imcl0se for other 
structuring eljements. 

Fill in around diagonalljy connected foreground pixels. 

Dilation usinga3 X 3 structuring element; use imdilate for 
other structurimng elements. 

Erosion usinga3 X 3 structuring elementi use imerode for other 
Structuring elements，. 

Fill in single-pixel“holes”(background pixels Surrounded by 
foregrotund pixels); use imfill (See Section 11.1.2) to hll in 
larger holes. 

Remove H-connected foreground pixejs. 

Make pixelp a foreground Pixel if at least five pixels in ANs(D) 
(see Section 9.4) are foreground pixels; otherwise make 忆 a 
background pixel， 

Opening usinga3 X 3 structuring element; se function imopen 
for other structuring elements. 

Remove “interior”Pixels (foreground pixels that have no 


Yarlous starting images have been devised to demonstrate the effect of ( 
way' s jaws on Successive generations (See Gardner, [1970, 1971]). Consid 
exampje, an Initial image caljed the“Cheshire cat configuration，” 了 


>> bw1 = [ 


刁 癌 避 避 己 口 口 听 口 诈 
但 口 怕 口 中 口 避 口 品 
已 号 听 一 一 一 品 口 口 品 
避 避 下 所 十 一 一 捷 品 
名 己 一 已 王 一口 口 呈 
全 号 瑟 一 口 吕 吕 
怠 呈 下 癌 口 二 一 口误 
已 一 雪 一 总 口 吕 局 
性 避 口 口 避 听 口 口误 
已 已 避 避 口 避 口 口 


The following commands perform the computation and display up to thet 
geneIation: 


>> jmshow(bw1，'n' )，title('Generation 1 ') 


>> bw2 = applIYJLUt( bwj，JLut) ; background neighbors). 

>> figure，imshow(bw2，':n');， title('Generation 2 ) Shrink objects with no holes to points; shrink objects with holes 
>>> bw3 = applylLut(bw2，1ut) ; to Tings. 

>> Tigure，inmnshow(bw3，'n'); title(' Generation 3 ) Skejletonize an lmapge， 


Remove SPUI Pixejs. 

Thicken objects without joining disconnected 1s. 

Thin objects without holes to minimaliy connected strokes; 
thin objects with holes to Tings. 


We leave lt as an exXercise to Show that after a few generations the cat fades to 
a“gIIin ”before finally leaving a“paw Print,” 


?4 Function bwmnorph tophat “Top-hat”operation using a 3 X 3 structuring element 
IPT function bwmorph implements a variety of useful operations based on cor imtophat (see Section 9.6.2) for other structuring 
elements. 


binations of dilations, erosions, and lookup table operations. Its calling syntax 


3 si 
AN 


Dorph = bwmor j . . ， 
0 本 Fig. 9.11(c) are fairly thick. It may be desirable for subsequent shape analySslS 
to thin the ridges so that each is one Pixel thlck. Each apPlication of bwmorph S 

inning operation removes one oOT two Pixels from the thickness of blinary 


image objects.The following commands,for example, display the resujts of ap- 


光 


where f ls an Input blinary image, operation is a String Specifying the desired 
operation, and n is a positive integer specifying the number of times the oper- 
ation is to be repeated. Input argument n is optional and can be omitted， 记 
which case the operation ls performed once. Table 9.3 describes the set of valid 
operations for bwmorph. In the rest of this section we concentiate on two 0 
these: 如 ining and SKeleloFiZatiom， 1 

7zmnning means reducing binary objects or Shapes in an image to strokes 
that are a single pixel wide. For example, the fingerprint ridges shown 也 


f= imread('fingerprint ClLeaned .tif ) 
> gt = bwmorph(fT， thin ，1) 

>> g2 = bwmorph(f， 'thin ，2) 1 

>> imshow(g1)，figure，imshow(g2 ) 
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FICURE 9.15 (a) Fingerprint image from Fig. 9.11(c) thinned once， (b) Image thinned twice, (cj Im 
thinned until stabijlity 


Bigures 9.19(a) and 9.15(b)， Tespectively Show the results. A key Question is Hg 
many times to apply the thinning Operatlon. For several operations, includ 认 
thimning, bwmorph allows n to be set to Infinity (Inf). Cajling bwmo rph with : 
Inf Instructs bwmorph to repeat the operation Until the image stops changi 
Sometimes this is cajled Iepeating an Operation zz 万 Siapilty. For example， 


， 三 I 二 。 - “ 软 
ai |GURE 9.16 (a) Bone image. (b) Skeleton obtained using function bwmorph. (c) Resulting skeleton atter 
ming with fanction endpoints， 天 站 
As Fig.9.15(c) shows, this is a significant improvement over Fig. 9.11(c)， 站 

3Keielomizatiom is another way to reduce binary image Objects to a set:0 人 
thin strokes that retain Important information about the Shapes of the originak 94 Labeling Connected Components 
objects. (Skeletonization is described in more detail in Gonzalez and Wood 
[2002].) Function bwmorph Performs skeletonization when operation is set 
 Skel .Iet f denote the image of the bonelike object ip Fig. 9.16(a).To cor 
Pute 1lts skeleton, we call bwmo rph, with n = Inf: 


he concepts discussed thus far are applicabie mostly to all foreground (or al 
kground) individual pixels and their immediate nelghbors. In this section 
consider the important“middle ground”between individual foreground 
Mels and the set of all foreground pixels.This leads to the notion of comzecied 
IDO1e11g, also referred to as opjecfs in the following discussion， 
:When asked to count the objects in Fig. 9.17(a), most people would identify 
nm: six characters and four simple geometric shapes. Figure 9.17(b) shows a 
mall rectangular section of Pixels in the image. How are the sixteen fore- 
&round plxels in Fig. 9.17(b) related to the ten objects in the lnage7 Although 
ey appear to be in two sepaiate gIOUPs, all sixteen Pixels actually belong to 
the letter“E” in Flg. 9.17(a), To develop computer programs that Iocate and 
operate on objects, we need a more Precise set of definitions for key terms， 

A pixel p at coordinates (x, y) has two horizontal and two vertical neigh- 
0IS Whose coordinates are (十 1] 四, (xz 一 1 人 门 ,(xy 十 1) and (xy 一 1)， 
This set of 4-7Peilghpory of p, denoted Ni(pj,is shaded in Fig. 9.18(aj.The four 
lagonal neighbors of D have coordinates (x 十 1， 少 十 下 )，(x 十 ly》 一 了 1)， 
-1Ly+l) and(z 一 1 一 1 Figure 9.18(bj shows these neighbors， 
Which are denoted Ap(p). TIhe union of NM(p) and Np(p) in Fig. 9.18(c) are 
he 8-meighpors of p, denoted Ne(D)， 

Two pixelsp ando are sajid to be 4-dcdjacen 让 0E Ni(P)， similarly， Danda 
Ie Said to be 8-adjacent 让 ge As(P)，Figures 9.18(d) and (e) illustrate 


>> fS = bwmorph(f，  Skel'，Inf)，; 
>> ImShow( 下 ) ， fizgure，imshow(fs) 


Figure 9.16(b) shows the Iesujlting skeleton, which is a reasonable likeness of 
the basic shape of the object、 

Skejletonization and thinning often produce Short extraneolus SPHIS, SOme- 
tmes called Parasitic CO1IDOHMe1H 帮 .The process of cleaning up (or removing): 
these spurs is caled pruning,. Function endpoints (Section 9.3.3) can be used: 
for tbis purpose. The method is to lteratively identify and remove endpolnts': 
The following simplje comrmands， for example，postprocesses the skeleton 
Image fs through five iterations of endpoint removals: 


= 1]1:5 
fS & ~endpoints(fsy) ; 


Figure 9.16(c) shows the result 
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FIGURE 9.17 
(a) Image 
containling ten 
objects.(b) A 
Sbset of Pixels 
代 om the 1 mage， 





FIGURE 9.18 (a) Pixel p and its 
4-nelghbors, A4(pP), (b) PixelP 
and its diagonal nelghbors， 
Aplp)j.(c) Pixelp and its 
8-neighbors, Nas(D). (d) Pixels 六 
aridd are 4-adjacent and 
8-adjacent, (e) PixelsPp and g are 
8-adjacent but not 4-adjacent. 
(f) The shaded pixels are both 
4-connected and 8-connected. 
(8) The shaded foreground 
pixels are 8-connected but not 
4-connected. 


these concepts, A_palp between Pixels Pi and Ph is a sequence of pixel 
.Di 1 pn Such that pk is adjacent to pk+l, for1 三 大 < 由 Apat 
can be 4-coHnpected oT 8-com1aected, depending on the definition of adjacenc 


P1，P2，.， 


Used. 


Two foreground pixels p and g are Said to be 4-connected 让 there exlst 
4-connected path between them，consisting entirely of foreground PIX 
[Fig. 9.18(f]. They are 8-connected if there exists an 8-connected path be 
tween them [Fig. 9.18(g)]. For any foreground Pixel, p, the set of all fore 
ground pixels connected to it is called the com7zected co1aPpomeHt containing 吕 
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Jbe term comnected comzpoHnemt was just defined in terms of a path, and the 
efinltion of a path in turn depends on adjacency. This Implles that the nature 
connected component depends on which form of adjacency we choose， 
tb 4- and 8-adjacency being the most common. Figure 9.19 illustrates the ef- 
that adjacency can have on determining the number of connected compo- 
nts in an image. Figure 9.19(a) shows a small binary image with ftour 
connected components. Figure 9.19(b) shows that choosing 8-adjacency re- 
&uces the nuUmber of connected components to two. 

JPT function bwlabel computes all the connected components in a binary 
age. The calling syntax is 
[L，num] = bwjlabel(f，conn) 

nerefis an input binary 1 image and conn Specifies the desired connectivity 
ither 4or 8). Output L ls called a iapel aatrix,and num (optional) glves the 
] number of connected components found. 区 parameter conn is omltted， 
alue defaujts to 8. Figure 9.19(c) shows the label matrix corresponding to 
,9， 19(a)j, computed using bwlLlabel(f, 4).The Pilxels in each different con- 
hected component are assjgned a unique integer, from ] to the total number 
connected components. In other words, the Pixels iabeled 1 belong to the 
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FIGURE 9.19 
Connected 
coImponents 

(aj Four 
4-connected 
coIDpPonents， 

(bj) Two 
8-connected 
components. 

(c) Label matIix 
obtained using 
4-connectivity 
(d) Labeil matrix 
obtained using 
8-connectivity. 
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EXAMPLE 9.7: 
Computing and 
displaying the 
center of mass of 
connected 
comPonents. 





LANQvector 
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PIEtT VEltes 7ne 
YY7GKE meanfA， 
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FIGURE 9.20 Centers 
of mass (white 
asterisks) shown 
Superimposed on 
thelr corresponding 


brst connected component; the pixels labeled 2 belong to the second con 
ed component; and so on. Background Pixels are labeled 0. Figure 9 
Shows the label matrix coOrresponding to Fig， 9.19(a)， computed :; 
bwlabel(f，8). 





























connected 
. ts. 
性 This example shows how to compute and display the center of mass of eache ee 
connected component in Fig. 9.17(a). First, we use bwlabel to compnute 专 
connected components: 
>> 二 = imread('objects,tif')， 
>> [|L，Dnj = bwlabel(f)， 
Function find (Section 5.2.2) is useful when working with label matrices- 
exampje, the following call to find returns the row and column indices for 
the pixels belonging to the third objecf: 
>> [FF，CcC] = find(L == 3) ; 
Function mean with rand c as inputs then computes the center of mass of 攻 
RE defined at coordinates (2,2). 
三 fgils the mask and 太 is the marker,the reconstruction of g from 记 denoted 
>> roar = mean(r)i j Ined by the following iterative procedure: 
>> Cbar = mean(c)， j 帮 ,is defined by 8 P 
Jnitiallize ja to be the marker imapge 矿 
A loop can be used to compnute and display the centers of mass of all the qi Create the structuring element:B = ones(3)， 
jects in the image. To make the centers of mass visible when superimposeda ，Repeat 
了 是 6 六 逢 _ 且 
the Image, we display them using a white marker on top of a black fjie Ai = (太田 BJmg 
clrcle marker, as follows: 
untj] 7 一 慷 : 
>> imshow( 咎 ) ， 
>> hold on 和 So later plotting commands plot on top of the image Marker 三 must be a subset of g; that is， 
>> for K = 13:n ES 
[r，c] = find(L == K); 
rbar = meantr)， “Figure 9.21 illustrates the preceding iterative procedure. Note that，al- 
cbanr = mean(c) ; hough this iterative formulation is useful conceptually much faster comptuta- 
plot(cbar，rbar，'Marker'，:01， MarkerEdgeColor '，'K'，.，. ional algorithms exist, IPT function imreconstruct Uses the “fast hybrid 
WarkerFaceColor '，"k'，'WMarkerSize'，10) econstruction” algorithm described in Vincent [1993]. The calling syntax for 
pJot(cbar，rbar，'Marker' ， 5x 1 ， WarkerEdgeColor' ，'W' ) mreconstruct is 
end 0 SS 
out = imreconstruct (marker，mask ) peconstruct 
Figure 9.20 shows the result. 3 
here marker and mask are as defined at the beginning of this section. 
小 】 Opening by Reconstruction See Sections 70.42 
， ， 。 。 ad 1043 了 raddi- 
Structuring eljement (instead of a single Image and Structuring element). One morphologlcal opening, erosion typically removes small objects, and the 0 





Subsequent dilation tends to restore the shape of the objects that remain. oFmaorphological 


Image,， the marker is the Starting point for the transformation. The other ee 
However the accuracy of this restoration depends on the Similarity between recomnsrrttctiom。 


Image, the maisK, constrains the transformation The structuring element used 
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conents Or broken connection paths. Jhere is no Poir 全 中 
npastthe level of detail required to identify those - 志 
Segmentation of nontrivial imasgcs js one of the moO; 和 下 
导 processing. Segmentation accuracy dctermincs the ev 办 
ATeomptterized analysis Proccdures. For this reason， 入 
号 Dken to improve thc probabiiity of ruggcd Segment FIGURE 9.22 
hasindustrial inspection applications,al least Some 人 
Deenvironmaentis possihpje at times, The experiencced Morphoio gcaj 
二 esianeT invatriahlv Pavs considerable atteniiop to Suc reconsStruct1on: 
es (a) Origlinal 


image. (D) ErIoded 
with vertical line. 
(c) Opened with a 
和 vertical line. 
p 1 ppl (d) Opened by 
Lp ptt mh pP reconstruction 
_Dj PP 二 with a vertical 
line.(e) 也 oles 
filled. 
(f Characters 
touching the 
border (See Tight 
border)， 





Pencnts @T Brekcn CerincectigSr Pazths. There js ne@ 介 帮 ! 
lien Past thc jevcl @i gctaii rcaUirel {@ jalcritify thesc 
Segmentatien ef nenirivial imagcs is enc @f thc mg 
咱 recessin 史 . Segmentatien acculacy &ctcrmines the ev 
十 时 cemguterizcg anaiysis 首 [@CCUTCS. 上 erT this TcaSSD,( 
bc iaken teinpreve the Preeaeility siru2gcadscgnmcnt 
such as industrial insPcctien za 玫 icatienpy at |east SSDIC 





te envirenment is Possible altimecs. The cxpericnccal (g) Border 

devisner invariahly Pays conpsiderable attentien te SUC 

Ca characters 
ponents or broken conncction Paths._Ihere is no poi TeImnoved . 


tion past tbe level of detail reduired to identify those 
Segmentation of nontrivial images is one of the mo 
Processing. Segmentation acecuracy detcrmines the ev 
.of compittterized analysis procedures. For tpis reason， 
be taken to improve the probability of rugged segment 
such as industrial inspectuon appliieauons at ieast some 
te environment is possible attimes The experienced 
desipper invariably pitys consideraple acention to SUC 





三 癌 和 
PIGURE 9.21 Morphological reconstruction. (a) Original image (the mask). (b) Marker image. (C)-(e) Inte 
mediate result after 100, 200, and 300 iterations, respectively. ( 纪 Final result, [TIhe outlines of the object 这 
the mask image are Superimposed on (b)-(e) as visual references.] 二 


投 


fo = imopen(f，ones(51，1) ) 


te that the vertical strokes were restored, but not the rest of the characteTs 


the shapes and the structufing element.The method discussed in this secti 机 ， 
ontaining the strokes. Finally, we obtalin the reconstruction: 


OpeHing py recoHstructfiom, restores exactly the Shapes of the objects that Te 
main after erosion. The opening by reconstruction of F, using structuring e 
ment B ,is defined as Rr(FeB)， > fobr = imreconstruct(fTe， 了 ) 
VDhe result in Fig. 9.22(d) shows that characteTs containing long vertical Strokes 

Were Testored exactly; all other characters were removed. The remaining Parts 


of Fig. 9.22 are explained in the following two sections. 


已 XAMPLE 9.8: 时 A _ comparison between opening and opening by reconstruction for 旺 
ea Image containing text is Shown in Fig. 9.22. In this example, we are interesteG 
in extracting from Fig. 9.22(a) the characters that contain long vertical StTOKeS< 

Since opening by reconstruction requires an eroded image, we perftorm that 

step first, using a thin, vertical structuring element of length proportional to 

the height of the characteTrs: 


Filling Holes 


Lorphologicaj reconstruction has a broad spectrum of practical applicatlons, 
Kach determined by the selection of the marker and mask images. For cxXam- 
ie suppose that we choose the marker image， 广 ,to be 0everywhere exXcept 
on the image border, where it is setto 工 一 广 


-Ac 这 (xisonthe border of /了 
jn 攻 一 0 otherwise 


>> 千 = imread('book text bw.tif ) ; 
>> fe = imerode(f，ones(51，1)); 


] 


Figure 9.22(b) shows the result, Tbhe opening, Shown in Fig. 9.22(c),1S comput: 
ed using imopen: 
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g the structuring element abonut its origin and translating it to all loca- 


= [Ri iing th 1 
ce 8 一 LeCjm] Das the effect of filling the holes in 户 as 和 the Image,just as the convolution kernel is rotated and then translated 


Hg ?9.22(e). IPT function imfil11 performs this computation automati 
when the optional argument 'holes' is used: 


g = infilli(f，'holes ') 





This function is discussed in more detail in Section 11.1.2. 


?.3.3 Clearing Border Objects 


Another useful application of reconstruction js removing objects that tou 计 
the border of an image.Again, the key task is to select the appropriate ma 
and Imask images to achieve the desired effect. In this case, we use the oO 
Image as the mask, and the marker image, 六 is defined as 


_ jzy) 让 (2y)isontheborderof 
7 多 一 10 otherwise 





ww would look likea digital “surface”with the height at any pair of coordi- 
being given by the value of b at those coordinates. 

practice, gray-scale dilation usually is performed using jar structuring el- 
ts (See Jable 9.2) in which the value (height) of bis 0 at all coordinates 


which Di is defined.That is， 
Figure 9.22( 们 shows that the reconstruction ， Rr( 岂 ,contains only the obj 这 家 
touching the border.The set difference 太一 rshown in Fig.9.22(g) xy 一 0 for(z 7)s 疡 
tains only the objects from the original image that do not touch the bo d。 ithis case, the max operation is specified completely by the pattern of 0s and 
IPT function imclearborder Performs this entire procedure automatic: inary Inatrix Z, and the gray-scale dilation equation simpjlifies to 


Its syntax is UBDzy) = max{fFrz 一 xy (xye) 


, fat gray-scale dilation js a local-maximum operator, where the maxi- 
S taken ovVer a Set of Pixel neighbors determined by the shape of 刀 . 
onflat structuring elements are created with strel by passing it two ma- 
S: (1T) a matrix of 0s and 1s Specifying the structuring element domain 疙 )， 
2) a second matrix specifying height values， px ,7 ). For example， 


AN 


Caearborder g = imclearborder(f，conn) 


内 ， 





where f is the input image and g is the result.The value of conn can be el 
4or 8 (the default). This function suppresses structures that are lighter th 
thelr surroundings and that are connected to the Image border. Input f canb 
a 8gTay-scaje or binary image.The output image isa gray-Scale of bljnary image 


TeSpectively. = Strel([11 1]，[1 2 1]) 


Gray-Scale Morphology 





Allthe binary morphological operations discussed in this chapter, with the 


ception of the hitror-miss transform, have natural extensions to 8Iay-Scale im 1 1 1 
ages. In this section, as in the binary case, we start with dilation and erosic 鹿 Height: 
which for gray-scale images are defined in terms of minima and Iaaxima ol 1 2 


Pixel neighborhoods. 
cates a 1 X 3 structuring element whose height values are 思 (0， 一 上 ) = 十 ， 
小 0) = 2, and (0,1) = 1 

lat structuring elements for gray-scale images are created using strel in 
Uie same way as for binary images, For example, the following commands 
iow how to dilate the Image f in Fig, 9.23(a) using a fat 3 X 3 structuring 
erIment: 


y.5.| Dilation and Erosion 









The Sray-scale diation of Aby structurling element b, denoted 上 田间 is d 
fined as 


Ue@btD) = maxfjz 一 cy 一 YY)+bcyJlcy)sD 


where Do is the domain of p, and F(x, y) is assumed to eqtal 一 co outside tf 
domain of This equation implements a PITocess SImilar to the concept of Sp 


.Se = Strel('square'，3); 
flal convolution，explained in Section 3.4.1 Conceptuaily，we can think 坟 


imdilate(f，se); 


( 避 
心 . 
及 
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ab 

C 

FIGURE 9.23 
Dilation and 
eTOsl0n. 

(a) Original 
Image.(b) Dilated 
Image.(c) Eroded 
Image， 

(d) Morphological 
gradient. 
(OrTliginal image 
CoOUrtesy of 
NASA.) 













































汪汪 0 Te 9.23(c) Shows the result of using imerode with the Same structuring el- 
7 人 和 ent used for Flg. 2.23(bD) 


= imerode(Tf，Se); 
jlation and erosion can be combined to achieve a variety of efftects. For in- 
tance, Subtracting an erfoded image from lts dllated VerTslon prfoduces a “Imor- 
opgical gradient”which is a measure of local gray-level variation in the 


apge. For example, letting 
orph_grad = imsubtract(gd，9ge) ; 


ced the image in Fig. 9.23(d), which is the morphological gradlent of the 
ge iD Fig. 9.23(3). This image has edge- -enhancement characteristics Simjlar 
10se that would be obtained using the gradient opeTatlons discussed In 
Ions 6.0.1 and later in Section 10.1.3. 


Opening and Closing 


Xpressions for opening and closing gray-Scale images have the Same form 
heir binary counterparts. The opening of image 厂 by structuring element D， 
enoted F。p, is defined as 


太 ob =(FeD)@5 


before, this is simply the erosion of 太 by b, followed by the dilation of the 
tby p.Similarly,the closing of Aby D,denoted 矿 .5,is dilation followed by 


Oil， 
让. = (F@D)6D 


h operations have Simple geometric interpretations.Suppose that an Image 
ction F(x, y) is viewed as a 3-D surface; that jls, its intenslty values are In- 
rpreted 3aS height values Over the Yy-plane. Then the opening of 六 by 2 can 
einterpreted geometrically as Pushing structuring element up agalnst the 
Mderside of the surface and translating it across the entire domain of 矿 The 
ning is constructed by finding the highest points reached by any Part of the 
cturing element as it slides against the undersurface of / 

Figure 9.24 ilustrates the concept in one dimension. Consider the curve in 
8&. 9.24(a) to be the values along a single row of an image. Figure 9.24(b) 
ows a flat structuring element in several positions, pushed up against the 
ottom of the curve. The complete opening is Shown as the cuUrve along the 
p of the shaded region in 丽 g. 9.24(c), Since the structuring element ls too 
e to fit inside the upward peak on the middje of the curve, that peak ls re- 
ved by the opening. In general, openings are Used to remove Small bright 
details while leaving the overall gray levejls and larger bright features reja- 
Myely undisturbed， 

igure 9.24(d) provides a graphical illustration of closing, Note that the 
sructuring element is pushed down on top of the curve while being translated 


Hgure 9.23(b) shows the restlt,As expected, the image is slightly blurred: he 
rest of thls fgure is explained in the following discussion， 

The 8ray-scale erosiom of 广 by structuring element D, denoted 厂 提 pb, is 
fned as 


USep 基 二 mintfz+x yy+y) 一 pz) zy)s 旋 ] 


where Do ls the domain of p and Fx, y) is assumed to be +oco outsjide thed 
main of 刻 Conceptually, we again can think of translating the structuring ele 
ment to all locations in the image. At each translated location, the structuring& 
eljement values are Subtracted from the imapge Pixel values and the minimum 
1 taken. 

As with dilation, gray-scale erosion is most often performed using flat strUC 
turing elements. The equation for Hat gray-scale erosion can then be Simplified 雪 


US9btzy)=minffz+x yy+y) xy)sD] 


Thus, at gray-scale erosion is a local-minimum operator, in which the IIDI 
mum is taken over a Set of pixel neighbors determined by the shape of 已 
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CopzPuERS IFe mmOr- 
PPoiogical 8radiemti 
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FIGURE 9.24 
Opening and 
closing in one 
dimension. 

(alj Original 1-D 
Signal.(b) Flat 
structurlng 
element Pushed 
up underneath 
the signal. 

(c) OPening. 
(d) Fiat 
Structurlng 
element pushed 
down along the 


top of the Signal. 


(e) Closing， 


正义 AMPLE 9.9: 
Morphological 
smoothing Using 
openings and 
closings. 
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imread( :plugs.,jpg ) ; 
= Strel( disk ，5) ; 

= imopen(f，Se); 

= lmclose(fo，Se); 


18ure 9.25(b) Shows the opened image, fo, and Fig. 9.25(c) shows the closing 
the opening, foc. Note the smoothing of the background and of the details 
the objects. This procedure is often called open-close iiering Close-opem 
err718 produces Simijlar results， 
nother way to use Openings and closings in combination is in Qitier7ating 
ential jlering. One form of alternating sequential filtering is to perform 
nclose filtering with a series of structuring elements of increasing Slze. The 
ljowing commands illustrate this process, which begins with a small structur- 
element and increases its size Untilit is the same as the structuring element 


to ai Jocations. The ciosing,shown in Fig. 9.24(ej,is constructed by finding 世 be dto obtain Figs. 9.25(b) and (c): 


lowest points reached by any part of the structuiing element as it Slides agains 
the UPPer side of the curve. Here, we see that closing SUppresses dark deta 芷 
Smalljer than the structuring element. 


fasf = T; 
for k = 2:5 

Se = Strel( disk' ，Kk)，; 

fasf = imclose(imopen(fasf，se)，se); 
end 


风 Because opening suppresses bright details smaller than the structuring el 
ment, and closing Suppresses dark detaijls smaller than the structuring elemenft 
they are used often in combination for image smoothing and noise remoyval: 焉 
this example we use imopen and imclose to smooth the image of wood dowej 
Plugs shown in Fig. 9.25(a): 


ae TeSujt, Shown in Fig. 9.25(d), yielded sjightiy smoother results than Using 3a 
ngle open-close filter, at the expense of additional PITocessing， 瑞 
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4 志 

FIGURE 9.25 
Smoothing using 
openings and 
closlings. 

(aj Original imase 
of wood dowel 
plugs. (b) Image 
opened using 
disk of radius 5. 
(c) Closing of the 
opening， 

(d) Alternating 
Sequential hjter 
Tesujt. 
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:f2 = imsSsubtract(f，Tfo) 


ibure 9.26(d) shows the result, and Fig. 9.26(e) shows the new thresholded 
*oe.The improvement is apParent. 


Subtracting an opened image from the original is called a iop-Apat transfor- 
iiion. IPT function imtophat performs thls Operation in asSingle step: 


导 


2 = imtophat(f，Se) ; 


Wiaction imtophat can also be called asg= imtophat(f, NHOOD),where NHOUD 
an array of 0s and ls that specifies the size and shape of the structurling eje- 
ient.This SYyntax 1$ the Same as using the call imtophat (ff，strel(NHO0D) 

related function, imbothat, performs 3a pottozaz-pat transformation, de- 
ned as the closing of the image minus the image. [Its Syntax is the Same as for 
inction imtophat, These two functions can be Used together for contrast en- 
incement using commands Such as 


-Se = strel('disk' ，3); 
> gg = imsubtract(imadd(f，imtophat(f，Se))， imbothat (ff ，5Se))j; 网 


Techniques for determining the size distribution of particles in an Image 
an important Part of the field of granuiommaetry, Morphological techniques 
be used to measure particle size distribution ipdirecty; that is withonut 
dentifying explicitly and measuring every particle. For particles with regular 
hapes that are lighter than the background, the baslc approach is to appjy 
norphological openings of increasing size. For each opening,the Sum of all tbe 
Kel vaiues in the opening 18 computed; this sum sometimes is caljed the 
WUrface area of the image. The following commands apply disg-shaped open- 
ipgs with radii0 to 35 to the image in Fig.9.25(0a): 


二 be 
中 
FIGURE 9.26 Top-hat transformation. (a) Original image. (b) Thresholded image. (c) Opened image. (d) TIo 民 

hat transformation. (e) Threshojded top-hat image. (Original image courtesy of The Math Works， Inc.) 


EXAMPLE 9.10: ” 轩 Openings can be used to compensate for nonuniform background illumrin 
Using the tophat tion, Figure 9.26(a) shows an image, f, of rice grains in Which the background 
transformation， is darker towards the bottom than in the upper portion of the image.The un: 
even ilumination makes _ image thresholding (Section 10.3) ditficuj 
Figure 9.26(bj,for example,is a thresholded version in which grains at thet 
of the image are well separated from the background, but gralins alt the botto 芋 
are improperly extracted from the background. Opening the image can pI0: 
duce a reasonabjle estimate of the background across the image, as jong 3S 外 
structuring element is large enough so that it does not fit entirely within 世 : 
rice grains. For example, the commands 


f = imread( pl1Lugs ,jpg )) 
Sumpixels = zeros(1，36) 
0:35 

Se strel('disk ，kK) 
fo imopen(f，sSe); 
Sumpixels(k + 1) = 


SAY 
- 
心 
本 
大 
所 


计 


Sumt(fo( :)) 
end 
> plot (0:35， sumpixels)，Xxlabel('k')，yiabel( Surface area ' ) 


lgure 9.27(a) shows the resujting plot of SumpiXe1S VerSUS k. More interest- 
strel( disk ，10) gjis the reduction in surface area between SuccessiVe Openlings: 


imopen(fT，Se) 


和 


2> Se 
>> fo 


“plLot(-diff(SumpziIxelSs) ) 
XLabel(k' ) 


resulted in the opened image in Fig.9.26(c).By subtracting tbls 1mage froma tb 
ylabel('Surface area reduction ) 


original image, we can produce an image of the grains with a reasonably eye 
background: 
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2 X 107 汗 
总 3 2.5 
FIGURE 9.27 3 3 间 
Granujometry 号 HGCURE 9.28 Gray- 
Sailrf D 2.3 

(a) Surface area 一 号 15 scale morphologicalj 
VerSUS Structuring 3 ? oq reconstruction in 
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We Another useful gray-scale Teconstruction technique is openizag-by- 
jeco1nstyructiom in which an image is first eroded, just as in standard morpho- 


logcal opening. However instead of following the opening by a closing, the 
sioded image is used as the marker image in a reconstruction, The Original 
mage 1$ used as the mask. Figure 9.29(a) shows an example of opening-by- 


Peaks in the plot in REig. 9.27(b) indicate the presence of a large number ofo 
construction, obtained using the comimands 


Jects having that radius. Since the plot is quite noisy we repeat this proced 


f= imread('plugs,.jpg'); 

Se = Strelf( disk'，5) |; 

Te = Imerode( 于 ，Sse); 

,fobr = imreconstruct(fe，f); 


original image. 


3.8.3 入 econstruction 


Gray-scale morphological reconstruction is defined by the same iterative PT 
cedure given in Section 9.5. Figure 9.28 shows how reconstruction Works 
one dimension, The top curve of Fipg. 9.28(aj is the mask while the bottom ， gF 纪 


Reconstruction can be used to clean up lmage fobr further by applying to 
a technique called closizzg-Dy-recomstructio7z. Closing-by-reconstruction is 


cuIVe 1 he marker In this case the markeris formed by subtracting a constant 二 

trom the mask, but in general any Signal can be used for the marker as long 到 FIGURE 9.29 

none of its values exceed the Corresponding value in the mask. Each iteratiof (a) Opening-by- 

of the reconstruction procedure Spreads the peaks in the marker curve un 人 志 reconstruction， 

they are forced downward by the mask curve [Fig.9.28(b)] 下 

. 、 . reconstructlon 

Jhe final reconstruction is the black curve In Fig. 9.28(c). Notice that 弓 folowed by 

two Smajller peaks were eliminated in the reconstruction，bnuit the two talle ciosing-by- 


peaks, although they are now shorter, remain. When a marker image is formae Techstet 下 
by subtracting a constant 疡 from the mask Image, the reconstruction is called， Ne 
the A-7atpipaa tansjorraz.The h-minima transform, computed by IPT functio 


2 min imhmin, js used to suppress small peaks， 
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Implemented by complementing an image，computing its opening 5 本 人 
reconstruction, and then complementing the result, The steps are as foll 于 


| 


>> fobrc = imcompIement{(fobr) ; ”和 机 | 吕 | 
>> fobrce = imerode(fobrc，se); 区 
>> fobrcbr = imcompliement(imreconstruct(fobrce，fobrc) ) ; 一 ER | 


SOKVER 上 4 MATRMX 


Figure 9.29(b) shows the result of opening-by-reconstruction folowed SS 人 
closing-by- reconstructton. Compare it with the open-close filter and alte 
Ing Sequential filter resualts in Fig. 9.25. 有 所 1 人 


EXAMPLE 912: 机 Our concluding example uses gray-scale reconstruction in several steps. 
Usng objective is to isolate the text out of the image of calculator keys show 
TeconsTHcHon [0 Fig,9.30(a).The first step is to suppress the horizontal reflections along theto 
Temove a COImP1lex 5 
image each key Jo accomplish this, we take advantage of the fact that these reflectio 
background， are wider than any single text character in the image. We perform opening 上 

Tecollstruction using a structuring element that is a long horizontal line: 


LOGi LN XEG 


于 二 号 考 内 - 具 呈 


SI 《COSTRN 


STG 


在 寺 看 


已 有 于 攻 知 


帮 1CCES 人 有 中 | 

XX2y: | 已 : 哄 | 

办 引 RI 所 和 T 且 以 攻 LL 如 在 闪 
3 9 

名 有 村 所 入 节 全 


汪 


>>T= imread(' calculator.jpg')， 
>> f_obr = imreconstruct(imnerode(f，ones(1，71))，f); 
>> To= inopen(f，ones(1，71));， 55 For comparison， 


本 


5 





The opening-by-Teconstruction (fT_obr) is shown in Fig. 9.30(b). For compai 
Son, Fig. 9.30(c) shows the standard opening (f_o).OPpening-by-reconstruc 
did a better job of extracting the background between horizontally adjac 
keys. Subtracting the opening-by-reconstruction 人 from the original 8 
called iopPpat-py-recomstructiom and is shown in Fig. 9.30(d): 


STO 
ER 

>> f_thr = imsubtract(f，Tf obr) ; 

>> f_th = imsubtract(f，f o); %5Or imtophat( 于 ，ones(1，71)): 





Figure 9.30(e) shows the standard top-hat computation (i.e.,f_th). 

Next, we SUpPress the vertical reflections on the right edges of the keysi 
也 5. 9.300d). This is done by performing opening-by-reconstruction with: 
small horizontaj line: 





>> g_obr = imreconstruct(imerode(f_thr，ones(1，11))，f_thr); FIGURE 9.30 An application of gray-scale reconstruction. (a) Original image. (b) Opening-by-reconstruction， 
cj Opening. (d) Tophat-by-Teconstruction. (e) Tophat, (人 Opening-by-reconstructlon of (dj using a horizontal 


lne 人 8) Diation of 区 using a horizontal line, (by) Final reconstruction ITesult， 





In the result [Fig. 9.30(],the vertical reflections are gone, but so are thin-ve 
tlcal-Stroke characters, Such as the Slash on the percent symbol and the“T 1 
ANIN,. We take advantage of the fact that the characters that have been SU 
Pressed in eITOT are VeIy close to other characters still present by first Pel 
forming a dilation [Fig. 9.30(g) IJhe morphological concepts and techniques introduced in this chapter constitute a 
bowerful set of tools for extracting features from an image. The basic operators of ero- 
ion, dilation, and reconstruction 一 defined for both binary and gray-scale image PIO- 
essing 一 can be used in combination to perform a wide variety of tasks. As Shown in 
the following chapter, morphological techniques can be used for 1mage Segmentation， 
Moreover, they play a major role in algorithms for image description, as discussed Im 


Chapter 11， 


>> g_ obrd = inmdilate(g_ obr，ones(1，21)); 


followed by a final reconstruction with f_thr as the mask and min(g_obrd 
f_thr) as the markerT: 


>> fT2 = imreconstruct(min(g_obrd，f_thpr)，Tf thr); 


Pigure 9.30(h) shows the final result. Note that the shading and reflections 0 
the background and keys were ITemoved successfully. 
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on of color Images are discussed in Section 0.6). We begin the develiopment 
ip methods sultable tor detecting intenslty discontinuities Such as points， 
ies, and edges. 了 dge detection in particular has been a staple of segmentation 
porithms for many years. In addition to edge detection per se, we also discusSS 
Cting Hinear edge Segments using methods based on the 豆 oxghn rrans 加 rza. 
:discusslon of edge detection ls folljowed by the introduction to threshold- 
人 Thresholdimng alsolis a fundamental approach to segmentatlon 


De edi is an Important factor. The discussion on thresholding ls followed by the 
opment of Tegion-oriented segmentation approaches. We conclude the 
iapter with a dliscussion of a morphological approach to Segmentation called 
atersjned Segaeratatior. This apProach is particularly attractive because it Pro- 







Peter 


enslty discontinuities in a digital image: points， lines, and edges. The most 
mmon way to look for discontinuities is to run a mask through the image in 
Imanner descrlbed in Sections 3.4 and 3.9. Fora3 X 3mask this procedure 
/olVesS computing the Sum of products of the coefficients with the intensity 
els contained in the region encompassed by the mask. That is, the response， 
of te mask at any point in the image js given by 















The material in the Previous chapter began a transition from image Processin 各 
methods whose inputs and outputs are images to methods in which the inpt 
are Images, bnut the outputs are attributes extracted 位 om those images. Se 
mentation js another major step in that direction. - 
segmentation Subdivides an image into its constituent regions or Object 
The level to which the subdivision is carried depends on the problem being 
Sojved. That is, segmentation should stop when the objects of interest in anf 
Phcation have been isolated. For example,in the automated inspection of elek 
tronlc asSemblies, interest jlies in analyzing images of the products with 也 
objective of determining the presence or absence of specific anomalies, suct 
mlssing components Or broken connection paths, There is no point in carryin 允 
Segmentation past the level of detail required to identify those elements. 
segmentation of nontrivial images is one of the most difficult tasks in imag 
processing. Segmentation accuracy determines the eventual success or failure 
computerltzed anajlysis procedures. For this reason, considerable care shouldhb 
taken to Improve the probability of rugged segmentation. fn some situatio 
Such as industrial inspection applications, at least Some measure of control ove 
the environment is possible at times. In others, as in remote Sensing， User contrg 
over Image acquisition is limited Principally to the choice of imaging Sensors. 泛 
Segmentation algorithms for monochrome images generally are based 0 
one of two basic Properties of image intensity Values: discontinuity and simi 
larity. In the first category, the approach is to partition an image based 0| 罗 
abrupt changes in intensity such as edges in an image.The principal approache ， 
es in the second category are based on partitioning an image into regions tb 
are Similar according to aset of predefined criteria. 
In thls chapter we discuss a number of approaches in the two categories juS 
imentioned as they apply to monochrome images (edge detection and Segmet 


作 三 ?2071 十 2272 十 十 20929 
9 
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Where zi is the intensity of the pixel associated with mask coefficient .Asbe- 
ure, the response of the mask is defined with respect to its center. 


]】 Point Detection 
e Re of lsolated points embedded i In areas of constant oOT nearly con- 
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FlICURE 10.1 
A mask for point 
detect1on. 
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上 XARNMDPELE I0.1: 
Point detectlon， 


az 

FICURLE 10.2 

(aj) Gray-scale 
Image with a 
nearly invisible 
Isolated black 
point in the dark 
gray 3aIea of the 
Dortheast 
quadfant， 

(b) Image 
Showing the 
detected Polnt. 
(The point was 
enjarged to make 
tt easier to See.) 










where 7 ls a nonnegative threshold. Point detection is implemented in 
LAB using function imfilter，with the mask in Fig. 10.1, or other Si 
mask, Ihe Important requirements are that the strongest response of a 下头 人 
must be when the mask is centered on an isolated point, and that the resw 
be0in areas of constant intensity. 

HTils given, the folowing command implements the point-detection 
PIToach just discussed: 


ngTtobethe maximum value in the filtered image, g,and then find- 





selecti 
































ted using the >= Operator for consistency lin notation. Since T Was Se- 
din this case to be the maximum value in g,clearly there can be no Polnts 
with values greater than T.As Fig. 10.2(b) shows,there was a Single ljSolat- 
Joint that satisfied the condition g>=T withT setto max(g(:)). 





>> g = abs(imfilter(double(f)，w)) >= T; 
other approach to point detection is to find the points in all nelghbor- 
Sods of size mm X mn for which the difference of the maximum and minlmum 
6ls values exceeds a specified value of T. This appProach can be Implement- 
dusing function ordfilt2 introduced in Section 3.2.2: 


where f is the input image, wis an appropriate point-detection mask [e.gssthe 
mask in Fig. 10.1j, and g is the resulting image. Recall 人 om the discussiom 
5ection 3.4.1 that imfilLter converts its output to the class of the IDput, s Y 
use double(f) in the filtering operation to prevent premature truncation 
values ii the imput is of class uint8, and becatuse the abs OpeTation does 
accept integer data.The output image gis of class 1ogical:its values are0 
1. 下 TIN not given, its value often is chosen based on the filtered resul 和 
which case the previous command string is broken down into three basic stenR 
(也 Compnute the filtered image, abs(imfilter(double(f)，w) )， (2) findit 
value for T using the data from the filtered image,and (3) compare the filteredg 
Image against T. This apProach is illustrated in the following example， 运 


= imsubtfPact(ordfilt2(f，mx*n，ones(m，n))，,.. 
ordfilt2(f，+，ones(m，n))) ; 


2 


:10.2(b).The preceding formulation is more flexible than using the mask in 
0.1. For example, 让 we wanted to compnute the difference between the 
st and the next highest pixel value in a neighborhood, we would replace 
on the far right of the preceding expression by msn 一 1.Other variationas 
his basic theme are formulated in a similar mmanneT 





型 Figure 10.2(a) shows an image with a nearly invisible black point in: he 
dark gray area of the northeast quadrant. Letting f denote this image, we fint 
the location of the point as follows: 
Line Detection 


人 next level of complexity is line detection, Consider the masks in Fig. 10.3. 


| Re We e first mask were moved around an image,it would respond more Strong- 
>> o lines (one pixel thick) oriented horizontally. With a constant background， 
>> in 和 be maximum response would result when the line passed through the mlddije 


of the mask. Similarly, the second mask in Fig. 10.3 responds best to jines 
nted at 二 4$", the third mask to vertical lines; and the fourth mask to lines 
he -4$。 direction. Note that the Preferred direction of each mask 18 





Vertical 





瓦 ofmizontal 
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FIGURE 10.3 Line 
detector masks. 
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于 入 AMPLE 10.2: 
Detection of lines 
im a specified 
direction. 
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Let Ri R2, Ra, and R4 denote the responses of the masks in Fig. 10.3,f 
leftt to right, where the Rs are given by the equation in the Prevlous Sectig 
uppose that the four masks are run individually through an image.IE ataes 
tain point in the image, | 有 | > |RRi for al 关 六 that point is said to be mo 世 
jikely associated with aline in the direction of maski For exXample,ifata p 
in the image,|R| > Ri fory = 2,3,4,that particular point is said to be m 
jhkely associated with a horizontal line. Alternatively we may be intereste 
detecting lines in a specified direction. In this case, we woujld use the mask: 
Sociated with that direction and threshold its output, as in the equation in 侍 
PIevious Section. In other words, 让 we are interested in detecting all the lin 
in an image in the direction defined by a given mask, we simply run the mas 
through the image and threshold the absolute value of the result. The poi 
that are jett are the Strongest responses, which, for lines one Pixel thick, corf 
sSpond closest to the direction defined by the mask.The following examplei 
lustrates this Procedure. ， 
油 Figure 10.4(a) shows a digitized (bjnary) portion of a wire-bond mask 区 
an ejlectronic circuit. The image size is 486 X 486 pixels. Suppose that we 
Interested in finding all the lines that are one Pixel thick, oriented at 一 45。. 
thls Purpose, we Use the last mask in Fig.10.3. Figures 10.4(b) through ( 提 wer 
generated Using the following commands, where fis the Image in Fig, 10.4(a 


>>wW= [2-1-; -tt2-1; -tt -1 2]; 

>> 9g imfilter(double(f)，w);， 

>> Imshow(g，[ ]) 5 Fig， 10.4(b) 

>> Qtop = 9 120 2120)5 

>> gtop = pixeldup(gtop，4); 

>> figure，imshow(gtop，[ ]) 所 Fig， 10.4(c) 
>> gbot = g(end-119:end ， end~-119:end ) ; 

>> gbot plIxelIdup(gbot ，41) ; 

>> figure，imshow(gbot，[ ]) % Fig. 10.4(d) 
>>g= abs(g); 

>> figure，imshow(g，[ ]) s Fig， 10.4(e) 

>> T = max(g(:)); 
>>g9=g >= 了 T; 

>> fiLgure，jimshow(g) 





% Fig，10.4(f) 
The shades darker than the gray background in 卫 g. 10.4(b) correspond to neg 
tive vajlues. There are two main segments oriented in the --45。 direction, one- 
耻 e top,lett and one at the bottom, right [Figs. 10.4(c) and (d) show zoomed se 
tions of these two areas]. Note how much brighter the straight line segment 
Flg. 10.4(d) is than the segment in Fig. 10.4(c).The reason js that the comPponett 
Im the bottom,right of Fig. 10.4(a) is one pixel thick, while the one at the top,]e 姓 
1S not. The mask response is stronger for the one-plixel-thick component. 
Figure 10.4(e) shows the absolute value of Fig. 10.4(b). Since we are inte 
ested in the strongest response, we let T equal the maximum value in tbl; 
IDage. Figure 10.4( 绅 shows in white the points whose values satisfied the 
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:法 

FIGURE 10.4 

(aj Image of a 
wire-bond mask， 
(b) Result of 
processing with 
the 一 49” detector 
in Fig. 10.3. 

(c) Zoomed view 
of the top,lett 
reglon of (b). 

(d) Zoomed view 
of the bottom， 
Tlght section of 
(b).(e) Absolute 
yajlue of (b). 
(Al points (in 
whlte) whose 
values Satisfied 
the condition 

g >=T, where g is 
the image in (e). 
(he Points in (人 
werIe enjarged 
slightly to make 
them easier to 
See.) 
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_change in areas whose Pixel values are variabjle, It is common prTactice to Tefer 































econd-order derivatives In Image PTIOCcesSsling are generally computed using 
eLaplacian introduced In Section 3.5.1, That ls, the Laplaclan of a 2-D func- 
ion jz, y) is formed from Second-order derivatives, as follows: 

_ 人) ac 


Dx 87 


condition g>=T, where gis the image in Fig.10.4(ej.The isolated points in t 
figure are polnts that aliso had Similarly strong responses to the mask. In 
orIlglinal image, these points and thelir Immediate Delghbors are orlented 
Such a way thbat the mask produced a maximum response at those lsojlate 1 
cations, These lsolated points can be detected using the mask in Fig. 10.1 
then dejleted, or they could be deleted using morphological Operators, 38 
cussed jin the last chapter. 


VF(z, J 


he Laplacian is seldom used by itsejf for edge detection because, as a Second- 


10.1.3 Edge Detection Using Function edge 坚 0Tder derivative', it is Unacceptably Sensltlve to nolse lts mmagnlitude pIoduces 
Uuble edges, and it is unable to detect edge direction. HoOwever, as discussed 


Although point and line detection Certainjy are Imnportant In any discussion 0l rin this section,the Laplacian can be apowerful complement when used in 
lage Segmentatlon， edge detection 人 by far the most comImOD apPrIoach 所 mbination with other edge-detection techniques. For example, altbhough its 
detecting meaningftu] discontinultles 1 Intenslty Values Such dlscontinu double edges make it unsuitably for edge detection directly, this Property can 
are detected by using first- and second-order derivatives. TIhe first-order d used for edge 1ocatio1. 

ative of cholce In Inage PIOCSSSIDS 1S the gradient， defined in section 0.0.1. W :With the preceding discussion as background, the basic idea behind edge 
Tepeat tbe perbnent equatlons here for convenience. fbe gradiemr of a 2: ction is to find places in an image where the intensity changes rapidly， 
function, Fx, y), is defined as the vector ing one of twWo general criterTia: 


凶 Find places where the first derivative of the intensity is greater In0 magnl- 
Vf =- Cr | | 6x tude than a specifiied threshold. 
Cy 9 .Find Places where the second derivative of the intensity has a ZerO 
07y cTOSSing， 


PT's function edge provides several derivative estimators based on the crite- 
just discussed. For some of these estimators,it 1s possibje to Specify whether 
ftie edge detector is Sensitive to horizontal orT vertical edges or to both. The 
eneral syntax for this function 1S 


The magnlitude of thijs vector is 
1/2 
VF = mag(Vf = [G2 + G3] 
[(aHaz)2 + (ay 


the Square-Toot operation， Ig，t] = edge(f， method ，parameters ) 


VJ = Gx + Gy Where f is the input image，method 1S one of the approaches listed in 
able 10.1, and parameters are additional parameters explained in the fol- 
oOWing discussion. In the output, g is a logical array with ls at the locations 
ere edge points were detected in f and 0s elsewhere. Parameter t 18 optlon- 
llt gives the threshold used by edge to determine wbich gradient values are 


trong enough to be called edge points. 


or by using absolute values， 
Vs= |G: + |Gj 


These approximations still behave as derivatives; that js, they are zero in areas 
constant intensity and their values are Proportional to the degree of intengl 


obel Edge Detector 


he Sopel edge detector uses the masks in Fig, 10.S(b) to approximate digital- 
he first derivatives Crx and CIn other words, the gradient at the centerT 
oint in a neighborhood is computed as fojlows by the Sobel detector: 


the magnitude of the gradient or its apPIoximations simply as“the gradient.” 

Atfundamental property of the gradient vector is that it points in the direc: 
tion of the maximunm Tate of change of 太 at coordinates (x, y). The angle 3 
whblch this maximum rate of change oOcCcUIS is 


C 
一 f | 下 
ax,y) = tan 忆 


One of the key jssues is how to estimate the derivatives Cu and Cy digitally The 
Varflous approaches used by function edge are discussed later in this section， 


g8=[G+G3 
{[(z7 + 2z8 + z9) 一 (zl+2z + 2) 
+[(z3 二 2z6+29) 一 (zi+2z+z7)] 站 


172 


10.1 外 Point Line, and Edge Detection 


385 


386 Chapter 10 强 Image Segmentation 


JABLE 10.1 
Edge detectors 
avallable in 
function edge, 





卫 asic Properties 










Edge Detector 








Sobel Finds edges using the Sobel approximatio 
the derivatives Shown in Fig. 10.5(b)， 
Prewitt Finds edges using the Prewitt approximatier 
the derivatives shown in Fig. 10.5(c). 
Roberts Finds edges Using the Roberts approximati 
to the derivatives shown in Fig. 10.3(d). 
Lapiaclan of a Gaussian (LoG) ”Finds edges by looking for zero crossings af 
fitering Fx, y) with a Gaussian filter. 
Zero crossings Finds edges by looking for zero crossings af 
filtering xy) with a userspecified filter 
Finds edges by looking for local maxima of 针 
gradient of /xz, y). The gradient is calcuiated 
using the derivative of a Gaussian filter The: 
method uses two threshoids to detect Strong 
and weak edges, and includes the weak edges 
in the output only 这 they are connected to 
Strong edges. Therefore, this method is more- 
likely to detect true weak edges. 








9 

















Canny 





(2 
(十 2 十) (zl 十 2z24 十 27) 


Then, we say that a pixel at location (cz y) is an edge pixelifg 兰 Tatthat 
cation, where 7 is a specified threshold. 
From the discussion in Section 3.5.1, we know that Sobel edge detection 把 
be Implemented by filtering an image, f, (using imfilter) with the left mil 
in Fig. 10.5(b),filtering f again with the other mask,squaring the pixels val 
of each filtered image,adding the two results and computing their square To00 世 
5imilar comrments apply to the second and third entries in Table 10.1. Funectia 
868dge SImply packages the preceding operations into one function call an 
adds other features, such as accepting 3a threshold value or determinin 
threshold automatically, In addition, edge contains edge detection technique 
that are not implementable directly with imfilter. 
The general calling syntax for the Sobel detector is 


Prewitt 





Ge 本 二 二 ET 人 二 二 二 二 
(zl 十 Z2 十 23) 【未 二 丽 十 友 ) 





Roberts 





[g， tl = edgef(f， Sobe1l:，T，dir) ， 
Tewitt Edge Detector 


WhereT is the input image,Tis aspecified threshold,and dir specifies the Pre 和 Prewitt edge detector uses the masks in Fig. 10.S(c) to approximate digi- 
[g， tl] = edge(fT， prewitt ，T，dir) 


TewWitt detector is slightly simpler to implement computationally than the 
obel detector, but it tends to produce somewhat noisier results. (It can be 
Shown that the coefficient with value 2 in the Sobel detector Provides 


Smoothing,) 
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FIGURE 10.5 
Some edge 
detector masks 
and the first-order 
derivatives they 
Implement. 
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Roberts Edge Detector 


Jhe Roberts edge detector uses the masks in Fig. 10.S(d) to appIOximate dig 
taly the first derivatives Cn and COy. Its general calling syntax is 


ny Edge Detector 


Canny detector (Canny [1986]) is the most Powerftul edge detector Pro- 
by function edge. TIhe method can be SumImarlzed as follows: 


The image is Smoothed using a Gaussian filter with a Speclfied Standard 
jeviation,o, to reduce noise. 

The jlocal gradient，8g(2 尹 = [G2 + G?] 7 and edge direction， 
(2 y) 三 tan (GV/G。)， are computed at each point. Any of the first 
three techniques in Jabjle 10.1 can be used to compute Crand Cy.Anedge 
oint is defined to be a point whose strength is locally mmaximum in the di- 


ection of the gradient. 

The edge points determined in (2) give rise to ridges in the gradient mag- 
itude image, The algorithm then tracks along the top of these rildges and 
ets to Zero all Pixels that are not actuajlly on the ridge top So as to give 3 


[1g，Ttl = edge(f， "roberts' ，T，dir) 


The parameters of this function are identical to the Sobel parameters : 
民 oberts detector is one of the oldest edge detectors in digital image proces 
ing, and as Fig. 10.5(d) shows, it is also the Simplest.This detector is used co 
Siderably less than the others in Fig. 10.$ due in part to its limited functionali 
(e.g, it 8 not symmetric and cannot be generalized to detect edges that 让 
mujtiples of 45 ). However'it still is used frequentiy in hardware implemen 
tons where Simplicity and speed are dominant factors. 





Laplacian of a Gaussian (LoG) Detector 


Conslder the Gaussian function 罗 in line in the output, a Process known as POPHIQXIMIQL SUDDression. The 
HRr) = - 252 idge pixels are then thresholded using two thresholds, 77 and 72，with 

了 < 72.Ridge Pixels with values greater than 72 are Sald to be Strong 

where 六 = 六 十 妈 andeisthestandard deviation.Thisis a Smoothing func edge pixels. Ridge Pixels with values between Tand7y2 are sald to be 


”Weak "edge PIxel$、 
Finally, the algorithm Performs edge jinking by incorporating the Weak 
-pixejls that are 8-connected to the strong PIXels. 


tion whlich, 过 convolved with an image, will blurit.The degree ofblurring is de 
termined by the value of co. The Laplaclan of this function (the secon 
derivative with respect to 六 is 
Vz2p(r) = - he syntax for the Canny edge detector is 
[gt] = edge(f， "canny ，T，Sigmal) 
For obvious reasons, this function is called the Laplacian of a Gaussian (Lo 
Because the second derivative is a linear operation, convolving (filtering) : 
image with V“Apn(r) is the same as convolving the Image with the smooth 
function first and then computing the Laplacian of the result. This is the ke 
concept underlying the LoG detector. We conyvolve the image with V27 
knowing that it has two effects: It Smoothes the image (thus reducing noise 
and it computes the Laplacian, which yields a double-edge image, Locatin 
edges then consists of finding the zero crossings between the double edges. 
The general cajling syntax for the LoG detector is 


here Tis a vector,T= [T1,，T2],containing the two thresholds explained In 
3 of the preceding procedure, and sigma is the standard deviation of the 
othing filter. 于 tis included in the output argument, it ls a two-element 
ctor containing the two threshold values used by the algorithm, The rest of 
the syntax is as explained for the other methods,inciuding the automatic com- 
piitation of thresholds iT is not supplied.The defaujt value for sigma ls 1 


AN 


; We can extract and display the vertical edges in the image, f, of Fig. 10.6(a) 


[g， tj = edge(f， 10g ，T，Ssigma) AsSIng the commands 


where sigma is the standard deviation and the other parameters are 3S 6X 
Plained previously. TIhe default value for sigma is 2. As before, edge ignore 
any edges that are not stronger than T.IfTis not provided,or it is empty,[ 
edge chooses the value automatically Setting T to 0 produces edges that 3 
closed contours, a familiar characteristic of the LoG method， 


 [gv，t]l = edge(f，'sobel'，'Vvertical ' ) | 
”imshow(gv) 


0.0516 
Zero-Lrossings Detector 


Thils detector is based on the Same concept as the LoG method, but the conyo 
lution ls carried out using a specified filter function, H.The cailling syntax is 


As Fig. 10.6(b) shows, the predominant edges in the result are vertical (the 
inclined edges have vertical and horizontal components, so they are detect- 
ed as well). We can clean Up the weaker edges Somewhat by Specifying a 
higher threshold value. For example, Fig. 10.6(c) was generated usling the 


Jhe other parameters are as explained for the LoG detector， comnmand 


[g，t]j = edge(f， "zerocross' ， 工 ，H) 
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也 XAMIPL 了 10.3: 
尼 dge extraction 
with the Sobel 
detector. 
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Cd 

FICURE 10.6 

(a) Original 
image. (bj) Result 
of functton edge 
Using a vertical 
Sobel mask with 
the threshold 
determined 
automatically. 

(c) Result using 3 
specifled 
threshold. 

(d) Result of 
determining both 
vertical and 
horizontal edges 
with a Specifled 
threshold. 

(e) Result of 
computing edges 
at 49” with 
imfilter Using 3 
speclified mask 
and a specified 
threshold. (全 
Result of 
computing edges 
at 一 45”with 
imfilter Using a 
specified mask 
and a Specified 


threshold. 
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Sobel1' ，0.15， vert1lIcal ) ; 


2 


上 2] 


0 
0 1 
2 
sq45 = imfilter(doubleff)， Feplicate ) 
= 二 0.3*max(absf(g45(:))); 
dg45 = g45 >= T) 
Igure，imshow(g45) 


W45 ， 


Strongest edge in Fig. 10.6(e) is the edge oriented at 43?". Similarly， 


sing the 'prewitt' and 'roberts' options in function edge follows the 
me general procedure just ilustrated for the Sobel edge detector. 南 


ittile roof The principal edges of interest in this discussion are the building 
Fners, the windows, the light-brick structure framing the entrance, the en- 


krance itself the roofline, and the concrete band surrounding the building 


Sobel1 ) ;和 Fig， 10.7(al) 
工 0g ) 和 Fig，10.7(C) 
Canny '); 各 Fig， 10.7(e) 


[g_sobel default，ts] edge (了 ， 
有 [g_ 1og defaulit，tlog]j = edgef(f， 
ke [g_ Canny_ default，tc] edge (ff ， 





le values of the thresholds in the output argument resulting from the pre- 
ceding compnutations were ts = 0.074, tl1og = 0.0025, and tc = [0.019， 
047].The defaults values of sigma forthe '1og' and 'canny' options were 
0 and 1.0,respectively With the exception of the Sobel image, the default re- 
sults were far from the objective of producing clean edge maps. 
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7TPe vatuie orT was 
CPoser EXPeFrInteH- 
ialiy io SPom resttit 
cOP2zDarapie mit 
Fi8S. 7O(c) ard 
7O(d) 


卫 XAMPLE 10.4; 
Comparlson of 
the Sobel, LoG 
and Canny edge 
detectors. 
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tarting with the default values, the ParameteTISs in each option were varied 
factively with the objective of bringing out the Principal features men- 
dearlier while reducing irrelevant detail as much as possible. The results 
e right column of Fig. 10.7 were obtained with the following commands: 


a-F 
cd 
人 


FIGURE 10.7 Left 


column: Defaujt 
resujlts for the sobel best = edge(f，'Ssobel ，0.05) % Fig，10.7(b) 


Sobel LoG,and 有 有 E 二 0 10g best = edge(f，'10g'，0.003，2.25) % Fig，10.7(d) 
Canny edge 0 人 canny_best = edge(fT，'canny ，[0.04 0.10]， 1.5); 5% Fig， 10.7(Tf) 
detectors. Right 一 本 

columan: Results ig. 10.7(b) shows, the Sobel result actuajly deviated even further from the 


ob a ine a PT ctive when we tried to bring out the concrete band and left edge of the en- 
teractivet 半生 上 e way. The LoG result in Fig. 10.7(d) is somewhat better than the sobel 
0 ilt and much better than the LoG default but it stil could not bring out the 


PrIincipajl features 
in the original dge of the main entrance nor the concrete band around the building. Ehe 


| ) wbil y result [Fig. 10.7(D] is superior by far to the other two results. Note in 
0 ficular how the left edge of the entrance Was clearly detected, as were tbe 
iete band and other details such as the complete roof ventilation grill 


iTrrelevant, fine 
detall. The Canny 
edge detector 
produced the best 
TeSujts by far. 


e the main entrance. In addition to detecting the desired features, the 
ny detector also produced the cleanest edge map. 山 


现 Line Detection Using the Hough Iransform 


lly the methods discussed in the previous sectlon should yield Pixels 
g only on edges. In practice, the Tesulting pixels seldom characterlze an 
ge Completely because of noise， breaks in the edge fom nonuniform 11ju- 
ination, and other effects that introduce spurious intensity discontinuitles， 
Phus edge-detection algorithms typically are followed by linking Procedures 
asseImnbje edge Pixels into meaningful edges, One approach that can be 
led to find and link limne Segments in an image is the 五 orgP trarsjorma 
Hough [1962])， 

Given aset of points in an image (typically a pinary image),Suppose that we 
ant to find subsets of these points that lie on Straight lines. One posslble SO- 
ion is to first find all lines determined by every pair of points and then find 
中 subsets of points that are close to particular lines. The probiem with thls 
Tocedure is that it involves finding mn(2 一 1)/2 ~ 1 jines and then perform- 
ng PnP 一 1))/2 ~ 到 comparisons of every point to all jines， This approach 
ompnutationally prohibitive in all but the most trivlal applications. 

With the Hough transform, on the other hand, we consider a Point (2 ， 力 ) 
nd all the lines that pass through lt， Infinitely many lines Pass through (xi， 力 )， 
of which satisfy the slope-intercept equation 其 二 03 十 pb for some vajlues 
fa and b. Writing this equation as 品 三 一 Xi0 十 其 and considering the apD- 
lane (also cajled Parameter Space) yields the equation of a sin8gle jne for a 
Xed Pair (Xi 六 ). Furthermore, 3 second point (xj, 攻 ) also has a line in para- 
meter Space associated with it, and this line Intersects the line associated with 
， 力 ) at (Ga pwhere 4 is the slope and 六 the Intercept of the jne con- 
ning both (xy) and (zh 邦 ) 记 the xy-plane. im fact， all points contained on 
his line have lines in parameter space that intersect at (0 ，,D ), Figure 10.8 让 
Ustrates these concepts， 
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he computational attractiveness of the Hough transform arlses from Sub- 





a 卫 
FICURE 10.8 ding the p0 Parameter Space lnto So-called accuraiator celis, as Ulustrated 
(a) xy-pPlane. ture 10.9(c), where (pmin, pmax) and (guin,gnax) are the expected ranges 
(b) Parameter the parameter values，Usually，the maximum range of values is 
AR 全 攻 002 二 0 三 90"and 一 已 三 p 三 DwhereZDisthedistance between corners 






ie image. The cell at coordinates (人 疙 , with accumulator value 4 人 门 , cor- 
sponds to the square associated with Parameter Space coordinates〈pPi， 6 门 ， 
]ly, these cejljs are set to zero Ihen, for every nonbackground 省 
欠 ) iD the image plane, we let 9 equal each of the allowed subdivision val- 
on the 9 axls and solve for the corresponding p using the equation 
Xrcos0 二 以 sin6.The resulting p-values are then rounded off to the 
mearest allowed cell value along the p-axis. Ihe corresponding accumuljator 
is then incremented.Atthe end of this procedure, avalue of Cin 4 人 让， 
ns that C points in the xy-plane lie on the line xcosb + ysSingbi = 
In Principle, the parameter-space jnes Corresponding to all image poin he number of subdivisions in the pb-plane determines the accuracy of the co- 


(cp y) 









large numbers of parameter-space lines intersect. A _ Practical difficulty 
thils apPproach， however, 1S that G (the of the jine) approaches infinit 


use the normal representation of a line: 








xcosbg+ysinbg=p it to Sparse matrix format by using function Sparse，which has the 
Sic SyntaxX 

0. A horizontal lne has 0 = 0", with p being equal to the bositive x- -intercegt 
Similarly, a vertical line has 9 = 90"，with p being equal to the Positive 和 
intercept, of g = 一 90?, with p 0 equal to the negative y intercept. 0 


S = Sparse(A) 









=[0 0 0 5 
荔 ) 0 2 0 0 
人 
0 0 4 01]; 
b 1 Da 
人 = Sparse(A) 
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FIGURE 10.9 (a)〈p,9) Parameterization of lines in the xy-plane. (b) Sinusoidal curves in the p6-plane; thbe 
point of intersection，(p ,6 )，corresponds to the parameters of the line joining (xy) and (2 基 


(9) Division of the p0- -Plane Into accumuiator cells. s -= sparse(fr，c，s，m，n) 
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2 
0 
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hough 


人 











sqrt((W 一 人)“2 + (N 一 172) 
ceiLl(D/drho) ; 

人 
1Linspace(-q*drho，q*drho，mnrho) ; 
val] = find(f) ; 

X 一 1 yY=y 一 1 

让 Ialize Output ， 
zeros(nrho，1length(theta) ) ; 


Here,rand c are vectors of row and column indices,respectively of then 
IO elements of the matrix we Wish to convert to Sparse format. Paramete S 
a vector containing the values that correspond to the index pairs (F ， cj),aia 
andn are the Tow and column dimensions for the resulting matrix.F 

Stance, the matrix S in the previous example can be generated directly W 
the command 


>> 3S = Sparse(I[3 2 34 1]，[12234]，[1234 5]，4，4) 






















Jhere are a number of other syntax forms for function sparse, as detailed 
the help page for this function. 

Glven a Sparse matrix S generated by any of its applicable syntax form 
can obtalin the full matrix back by using function ful1, whose syntax is 


alues at a tinme. 
kk = 1:ceil(Length(val)/11000) 


matrILX = repmat(x(first:1LIast)，1，ntheta) 

_matrix = repmat{(y(first:1ast)，1，nthetal) 

al matrix = repmat (vaL(first:1ast)，1，ntheta) ; 

theta matrix = Fepmat(theta，Size(x_matrix，1)，1)*pI/7180; 


rho _ matrix = X matrix,*Cos(theta_matrIXx) + ,'， 

yY matrix.xsin(theta_matrix) j 

pe = (nrho - 1)/(rho(lend) 一 PholTt)) 

_bin index = round(sSlope*(rho_matrix 一 rho(1)) + 1)3; 


a_bin_index = frepmat(1:ntheta， Size(X_matrix，1)， 十 ) 


ake advantage of the fact that the SPARSE function，which 
onstructs 8 sparse matrix，accumulates values When ILnput 
ndices are repeated，That's the behavior we want for the 
Hough transform， We want the Output to be a fulL1L (nonsparse ) 
atrix，however，sS0 we Call function FULL on the OutpPUt of 
PARSE , 站 
h + full(sparse(rho_bin_index(:)，theta_bin_index(:)，.…: 
val_matrix(:)，nrho，ntheta) ) ) 


A = fu1L1(S) 


Jo explore Hough transform-based line detection in MATLAB, we 
write a function, hough .m, that compnutes the Hough transform: 


function [fh，theta，rho]j = hough(f，dtheta，drho) 

%HOUGH Hough transfornm， 

[H，TIHETA，RH0O] = HOUGH(F，DTHETA，DRH0O) computes the Hough 
transform of the :image F，DTHETA specifies the spacing (in 
degrees) of the Hough transform bins along the theta axis。 DRHO 
Specifies the Spacing of the Hough transform bins along the rho 
axiSs. H is the Hough transform matrix，It is NRHO-by-NTHETA， 
where NRHO = 2*ceil(norm{size(F))/DRHO) -- 1，and NTHETA = 

2 “Cel1(90/DTHETA) ，Note that if 90/DTHETA is not an integer，th 
actual angle Spacing will be 90 / ceil1(90/7DTHETA) . 


THETA is an NTHETA-element vector containing the angle (in 
degrees) Corresponding to each column of H，RHO is an 

NRHO-element Vector Containing the value of rho _ corresponding to- 
each Fow of H， 


本 
den 


JIn this example we ilustrate the use of function hough on a Simple blinary 


[HH，THETA，RHO] = HOUGH(F) computes the Hough transform using iage. First we construct an image containing isolated foreground Pixels 1n 
DTHETA = 1 and DRHO = 1， :SeVeIal locatlons. 


ReoReoeeceasedeocee do do oo do Oo 


ee 1 zeros(101，101) 

end fl 1) =1T fl10， 1) = 1 f(1，101) = 1; 

if nargin < 2 f(101，101) = 1 Tf(51，51) = 1 

攻 人 gure 10.10(a) shows our test image.Next we compute and display the Hough 


f = doubjlie(f) ; 

[M,N] = Size( 和 ) ; 

theta = Jinspace(-90，0，ceil(90/dtheta) + 1T); 
theta = [theta -flipLr(theta(2:end -1))]; 
ntheta = 1ength(thetay) ; 


= hough(f ) ; 
>> imshow(H，[ ]) 


人 10.10(b) shows the result, displayed with imshow In the familiar Way 
However, it often is more useful to visualize Hough transforms in a larger Plot， 
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也 叉 AMPLEE 10.3; 
Illustration of the 
Hough transform. 
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慌 labeled axes. In the next code fragment we call hough with three output 
pimnentS; the second two output arguments contain the 0 and p values corre- 
nding to each column and row， Tespectively, of the Hough transform ma- 
Jhbese Vectors theta and rho, can then be passed as additional input 
ents to imshow to control the horizontal and vertical axis labeling. We 
ass the “notruesize' option to imshow. The axis function is used to 


人 


FICURE 10.10 
(a) Binary imapge 
with 在 ve dots 
(four of the dots 


， n axls jabeling and to Imake the display fill the rectangular shape of the 
(b) Hough e.Finajly the xlabel and ylabel functions (see Section 3.3.1) are used to 
transform el the axes Using a LaTeX-style notation for Greek letters. 

displayed using :9 

ImShow- ，theta，rho] = hough(f); 


(c) Alternative 
Hough transform 
display with axls 
jabeling. (The 
dots in (a) were 
enjarged to make 
thbem easier to 
See,) 


mshow(ttheta，rho，H，[ ]， "notruesize') 
XiSs 0n，axlis noprmal 
1abel( \theta )，ylabel('\rho' ) 





e 10.10(c) shows the labeled result. The intersections of three sinusoidal 
S at 士 42 indicate that there are twWo sets of three collinear points in f. 
He intersectlons of two sinusoidal curves at (0， pi 三 (一 90,0),(-90, -100)， 
0), and (0, 100) indicate that there are four sets of collinear points that lie 
ong vertical and horizontal lines. 


2.] Hough Transform Peak Detection 


le first step in using the Hough transform for line detection and linking 1S 

eak detection. Finding a meaningful set of distinct peaks in a Hough trans- 

im can be challenging, Because of the quantization in space of the digital 
ge,the quantization in parameter Space of the Hough transform, as well as 

fact that edges in typical images are not perftectly straight, Hough trans- 

Im Peaks tend to lie in more than one Hough transform cell One Strategy to 
ercome thls problem is the following: 


一 1I00 


Find the Hough transform cell containing tbhe highest value and record its 
location， 

9UPPress (set to zero) Hough transform cells in the immediate Deighbor- 
hood of the maximun found in step 1. 

Repeat until the desired number of peaks has been found, or until a SPpec- 


ified threshold has been reached. 


nction houghpeaks implements this strategy 


Unction [F，c，hnew] = houghpeaks(h，numpeaks，threshold，nhood) 
IOUGHPEAKS Detect peaks in Hough transform， 

[R，C，HNEW]】 = HOUGHPEAKS(H，NUMPEAKS ， THRESHOLD，NHOOD) detects 
peaks in the Hough transform matrix H，NUMPEAKS S$Speclifies the 
maximum number of peak locations to look for. Values of 有 below 
THRESHOLD will not be considered to be peaks，. NHO0OD is a 
two-element vector specifying the size of the SuppressSion 
nelghborhood. This is the neighborhood around each peak that 1IS 


100 
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houghpeaks 
疏 人 
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Set to zero after the peak is identified. The 6lementS of NHO0 
must be positive，odd integers. R and C are the row and Column 
coordinates of the identified peaks，HNEW is the Hough transfon 
with peak neighborhood suppressed， 


已 
心 
马 
《D 

山 


length(r) == numpeakS ; 


done = true 


If NHO0OD is omitted，it defaults to the smallest 0dd Values >= 
size(H)/50. If THRESHOLD is omitted，it defaujlts to 
0.5x*max(H(:))，If NUMPEAKS is omitted，it defaults to 1 ， 


if nargin < 4 
nhood = Size(h)y750 
% Make sure the neighborhood Size is 0dd ， 
nhood = max(2*ceil(nhood/2) + T，1) 
end 
If nargin < 3 
threshold = 0.5 * max(h(:)) 


Oo oo ooeooe go op 避 


unction houghpeaks 1 illustrated in Example 10.6. 


,2 Hough Transform Line Detection and Linking 


ce aset of candidate peaks has been jidentified in the Hough transform 1t 
mains to be determined if there are line segments asSoOciated with those 
aks, as well as Where they start and end. For each peak, the frst step is to 
d the location of all nonzero Pixels in the Image that contributed to that 
ak. For this Purpose, we write function houghpixe1S， 


end 

if nargin < 2 iction [FF，C] = houghpixels(f，theta， rho， rbin，cbin) houghpixelS 
numpeaks = 1 jl0OUGHPIXELS Compute :image pixels belonging to Hough transform bln， ON 

end [R，C] = HOUGHPIXELS(F，THETA，RHO，RBIN，CBIN) Computes the 


row-colLumn indices (R，C) for nonzero pixels in image F that map 
to a particular Hough transform bin，(RBIN，CBIN) RBIN and CBIN 
are scalars indicating the row-column bin 1ocation in the Hough 


done = falsei 
hnew = hjr= [];c= []; 





While ~done transform matrix returned by function HOUGH. THETA and RHQ are 
[p, q] = find(hnew == max(hnew(:) ) ) the second and third output arguments from the HOUGH function， 
p=p1igq=9(1) _ 上 f; 
if hnew(p，q) >= threshold ee 

(end + 1) = pj cl(end + 1) = 9 


eta C = theta(cbin) * pi / 180; 


% Suppress this maximum and itSs 5Cc10Se neighbors， 0 xy = xxcos(theta c) + yx*sSin(theta_c) 


p1=p=- (nnood(1) 一 1)1/2; pP2=pr+(nhood(1) -1)72; ho = length(rho) ; 

q1=gq- (nhood(2) 一 1)/2; 92=9+ (nhood(2) 一 们 /2 ope = (nrho -1)/(rho(end) - rho(t1)); 

[pp，9qq] = ndgrid(p1:p2，913:9q2) 1 0 bin index = round(SLope*(rho_xy 一 rho(1)) + 1T) 1; 
pp = pp(:); qq = qq() 网 


xX = find(rho_bin_index == rbin) ; 
s% Throw away neighbor coordinates that are Out of bounds in 
% the rho direction， 

badrho = find((pp < 1) | (pp > Size(h，1))); 

pp(badrho) = []; 9qq(badrho) = []; 


xf(idx) + 1; CC = y(idx) + 1; 2 


pixels_ associated with the locations found Using houghpixels IUSt be 
rouped into line segments. Function houghlines USes the following strategy: 

% For coordinates that are out of bounds in the theta 

% direction，we want to consider that H isS antiSymmetrIicC 
% along the rho axis for theta = +/- 90 degrees， 

theta too low = find(qqd < 1)) 

qq(theta_too_low) = size(h，2) + qq(theta_ too_LOow) 
pp(theta too_ low) = Size(h，1) ~ pp(theta too_JLowW) + 1 
theta too_highn = find(qq > Size(h，2) 1) 
qq(theta_too_high) = qq(theta_too_high) - Size(h，2) 
pp(theta_too_high) = Size(h，1) ~- pp(theta_too_high) + 机 


，Rotate the Pixel locations by 90” - 9 so that they he apProximately along 

avertical line， 

. Sort the pixel locations by their rotated x-values. 

.Use function diff to locate gaps, Ignore small gaps; this has the ettect of 
merging adjacent tine Segments that are separated by a smajl space. 

. Return information about line segments that are longer than some mlnl- 

_mum length threshojld. 


虽 


function lines = houghLlines(f,thetarhorryccyfil1gap,minlength) houghlines 
0UGHLINES Extract Line segments based on the Hough transform 2 
LINES = HOUGHLINES(F，THETA，RHO，RR，CC，FILLGAP，MINLENGTH) 


% Convert to linear indices to zero out al1 the values ， 
hnew(sub2ind(Size(hnew)，pp，dqq)) = 0; 
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of the Hough transform bins to USse in Searching for 1Line 


the Same Hough transform bin that are Separated by 1ess than 
FILLGAP pixels，HOUGHLINES merges them into a Single 1Line 
Segment ，FILLGAP defaujlts to 20 if omitted。 Merged Line 
segmentSs 1eSss than MINLENGTH Pixels 10ng are discarded 
MINLENGTH defaujts to 40 if omitted , 


LINES ls a_ Structure array Whose length equals the number of 


has these fields: 


oOeseoceoeotpspaoecdedeue op do do oo co es co 


point1 End-point of the line Segment， two-element vector 
point2 “ End-point of the 1Line Segment;， two-element vector 
Tength “ Distance between point1t and point2 

theta Angle (in degrees) of the Hough transform bin 

rho Rho-axiSs position of the Hough transform bin 


if nargin < 6 
fillJgap = 20; 

end 

if nargin < 7 
minlength = 40; 

end 


numlines = 0; lines = StPuct ; 
for kK = 1:JIengthfrr) 
rbin = rr(k); cbin = ccfk); 


#%5 Get all pixels associated with Hough transform ce11， 
[Fr，C] = houghpixeJls(f，theta，rho，rbin，cbin) ; 
if IISempty(r) 
Continue 
end 


当 Rotate the pixel 1Locations about (1,1) so that they 1ie 
各 &pproximately along a vertical 1Line， 

omega = (90 - thetal(cbin)) * pi / 180; 

T = [cos(omega) Sin(omega); -Sin(omega) cos(omegal) ] ; 

xy= [FF 一 1c 一 1 *T; 

X = SOrt(XY(: ,1)); 


委 Find the gaps larger than the threshold. 
diff xX= [diff(x); Infji 
idx = [0; find(diff x > fil1gap)]; 
for pP= 1:1ength(idx) 一 1 

X1 x(idx(p) + 1); x2 = x(idx(p + 1)); 

Jinelength = X2 一 X1; 

if Jinelength >= minlength 

point1 = [x1 rho(rbin)]; point2 = [fx2 rho(rbin)]; 






extractSs Jine Segments in the image F associated with particular 
bins in a Hough transform. THETA and RHO are vectors returned by 
function HOUGH， Vectors RR and CC Specify the Fows and columns 


SegmentS. If HOUGHLINES finds two Jine SegmentS asSSociated With 


nerged line Segments found. Each element of the Strudcture array-: 


5 Rotate the end-point 
和 angle， 

Tinv = inv(T); 

point1i = point1 * TinV; 


Locations back to the original 


point2 = point2 * Tinv; 





num]ines = numlines + 1 ; 





Lines(numlines).point1 = point1 + 1 
lines(numlineS).point2 = point2 + 1 
lines(numlines).1length = 1inelength; 


Tines(numlines) .theta = theta(cbin) ; 
Jines(numlines).rho = rhol(rbin) ; 
end 


nd 本 

















In thls example we use functions hough, houghpeaks, and houghlines to 
nd a set of jine segments in the binary image, f,in Fig. 10.7( 人 .First, we com- 
pute and display the Hough transform, using a finer angular spacing thban the 
default (A6 = 0.3 instead of 1.0). 


>> [H，theta，rho] = hough{(f，0.5) ; 
>> imshow(theta，rho，H， 上 ]， notruesize')，axis on，axis normal 
>> Xlabel( \theta')，ylabel('\rho') 


Next we use function houghpeaks to find five Hough transform peaks that are 
]ikely to be significant. 


>> [Fr，c] = houghpeaks(H，5) ; 

>> hold on 

>> plot(theta(c)，frho(r)，'Linestyle' ， 
marker' ，"'S ，' color' ，'W') 


none'，..， 


Figure 10.11(aj shows the Hough transform with the peak locations Superim- 
posed. Finally we use function houghlines to find and link line Segmients and 
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B = Inv(A) com1- 
Piles jpne inverse OF 
SGFe TIGIFLY A， 


上 么 AMPLE 10.6: 
Usinsg the Hough 
transform for jine 
detection and 
linking. 


ab 


FICURE 10.11 

(a) 再 ough 
transform with 
five peak 
locations selected. 
(b) Line segments 
Corresponding to 
the Housgh 
transtorm Peaks. 
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ethods for choosing a gobal threshold are discussed in Section 10.3.1. 种 


we SuUperimpose the line segments on the orlginal binary image using imsh 
stion 10.3.2 we discuss allowing the threshold to vary which is called /Jocal 


hold on, and plLot: 


>> LInes = houghlines(f，theta，rho，F，C) 

>> figure，imshow(f)，hold on 

>> for kK = 1:Length(1ines) 

xy = [1ines(k),.point1f+ ; Lines(k).point2] ; 

pJlot{(xy(; ,2)，xy(: 1)，'LineWidth ，4， Color ，[.6 ,6 .6])， 
end 


,1 Global Thresholding 


e Way to choose a threshold is by visual inspection of the image histograrmm， 
e histogram in Figure 10.12 clearly has two distinct modes; as a Iesujt, it 18 
vto choose athreshold Tthat separates them.Another method of chooslng 
sby trial and error, picking different thresholds until one is found that pro- 
ces a good result as judged by the observer. This is particularly eftective In 
和 interactive environment, Such as one that ajlows the user to change the 
shold using a widget (graphical controb such as a slider and see tbe result 
ediately. 

For choosing a threshold automaticajly， Gonzalez and Woods [2002] de- 
ibe the following iterative PITocedure: 


Figure 10.11(b) shows the resulting image with the detected segments sup 
Imbosed as thick, gray lines. 





Mk Thresholdqing 


Because of jlts intuitive Properties and Simplicity of Implementation，imaga 
tbresholding enjoys a central position in applications of image segmentati 
Simple thresholding was first introduced in Section 2.7.2,and we have used 让 
various discusslons In the preceding chapters. In this section, we discuss ways0 
choosing the threshold value automaticajly and we consider a method for val 
ing the threshold according to the properties of local image neighborhoods， 

Suppose that the intensity histogram shown in Fig. 10.12 corresponds to 
image, 帮 ULxz, y), composed of light objects on a dark background,in such a 
that object and background Pixels have intensity levejls grouped Into two do 
inant modes. One obvious way to extract the objects from the background is to 
Select a threshold 7 that Separates these modes. Then any point (zx, y) 扣 
wpich AIx,y) 三 Tiscalled an opject poinit otherwise, the point is called 3 
packgrouaa poink In other words, the thresholded image 8g8(xz, y) is defined 


-Select an initial estimate for 7. (A suggested initial estimate is the mld- 
point between the minimum and maximum intensity values in the Image.) 
-Segment the image Using 了. This will Produce two groups of pixels: Cn ， 
“consisting of all pixels with intensity values 三 7 ,and C2, consisting of PIX- 
“els with values < 了. 

Compute the average intensity values HA1 and Ha for the Pixels in reglonas 
-Gand CC . 

_Compute anew threshold value: 





1 
《7 = 了 (Ha 十 A2) 


Repeat steps 2 through 4 until the difference in 了 in SuccesSive lterTations 
1 这 帮 x, 功 兰 工 .smaller than a predefined Parameter 7o. 


0 这 F(xz) < 了 


Pixels labeled 1 correspond to objects, whereas Pixels labeljed 0 correspond to the 
background. When 7 is a constant, this approach ls called gliopa/ 护 respoldig 


8 )) 二 
We show how to implement this procedure in MATLAB in Example 10.7/. 





ld using Otsu's method (Otsu [1979]). To examine the formujlation of thls 
istogram-based method, we start by treating the normalized histogram as 3 


iscrete probability density function, as in 
FICURE 10.12 


Selecting 3 1r 

threshold by 斋 人 站 二 这 证 交 二 计 

vlsually analyzlng 

ablmodal here mis the total number of pixels in the image, mv is the number of Pix- 
pistogTram. 


ls that have intensity level m, and 人 is the total number of possible inten- 
ity levels in the image,. Now suppose that a threshold Kis chosen Such tbat 
0is the set of pixels with levels [0,1,.. ,大 一 1]andCnis the set of plXejls 
ith levels [K 大 二 1, 工 一 划 Otsus method chooses the threshold 
alue K that maximizes the petween-class variance a3，wWhich is defined as 





0 = oo(pho 一 MT7 + ol(A 一 Ar 
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-二 gpayt hresh 


了 上 和 AMPELE 10.7: 
Computing global 
thresholds. 


a b 


FIGURE 10.13 

{(a) Scanned text. 
(b) Thresholded 
text obtalined 
usling function 
graythresh. 


























ith Tequal to 101.47. 
Next we compnute a threshold using function graythresh: 


where 


天 一 ] 
(00 一 2 Pro( 广 ) 
如 一 () 
天 二 
1 三 0.3961 
2 399 


H0 三 王 omt )/ oo 5 


是 


T2 = graythreshfTf) 


上 芭 本 间 dpo(1zjM/wl iresholding using these two values produces images that are almostindistinguish- 
le 人 fom each other. Figure 10.13(b) shows the image thresholded using T2 果 
全 肖 记 2Pot7) 3.2 Local Thresholding 


the threshold value that maximjzes cg. The threshold is returned as a normial 
1ized value between 0.0 and 1.0.The calling syntax for graythresh is 


iproved thresholding result shown in Rig. 9.26(e) was computed by applying 


graythresh (ff) 5 
morphological top-hat operator and then using graythresh on the resnjt. 


where f is the input image and T is the resulting threshold. To segment the 
Image we use T in function im2bw introduced in Section 2.7.2. Because b 
threshold is normalized to the range [0， ]], it must be scaled to the Proper 
Iange before it is used, For example,iffis ofclass uint8, we multiplyT by255 
before Using it. 


有 让 FUzy) 关 TOxzy) 
0 


滤 In this example we illustrate the iterative procedure described Previoaal 
as Well as Otsu's method on the gray-scale image, f, of scanned text, shown 也 
Fig. 10.13(a). The iterative method can be implemented as follows: 


TD 功 =jot7)+T 


lihe Image (zy) is the morphological opening of 太 and the constant Tis 
下 he result of function graythresh applied to 六 . 


>> T=0.5*(double(min(f(:))) + double(max(f(:)))); 光 
>> done = falLSe; 
>> While -done 
是 
Tnext = 0.5*(meanff(g)) + 
done = abs(T 一 Tnext) < 0.5; 
T = Tnext+t; 
end 


Reglon-Based Segmentation 


e objective of segmentation is to partition an image into regions，In 
ectlons 10.1 and 10.2 we approached this problem by finding boundaries be- 
Ween Teglons based on discontinuities in intensity levels，whereas in 
ection 10.3 segmentation was accomplished via thresholds based on the dis- 
ibution of pixel Properties, such as intensity values. In this section we discuss 


For this particular image, the while loop executes four times and terminates egmentation techniques that are based on finding the regions directly， 


4 Basic 了 ormulation 


ponents or broken connection paths. There is mg pOI ， ， . 
et 信 TePIesSent the entire lmmage Tegion， We may YIew Segmentation 3s 3 


AAA 人 A 生 DPIOCeSS that partitions 及 into 7 subregions, Ri , R ,Rsuch that 
processirtg. Segnmientation accuracy determines the 
of computeTized anatiysis Procedures. For this reason 用 
be taken to improve the Probability of rugged segme 人 (8) LUR， 一 尺 : 
such ds mdustrial inspection appjications, at least soOm 若 R 
the environment IN Possihle :mtCs ERhe EXPETICnCL 和 (人 ) 及 ， jsa connected region， 1 一 1， 光 用. 


desipner invialriahly pilys considerable attention to Suc 
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lize the concept of size, likeness between a candidate Pixel and the PIxels 
Own So far (Such as a comparison of the intensity of a candidate and the av- 
ipge intensity of the grown reglon),and the shape of the reglon beling gTOWDn， 
euse of these types of descriptors is based on the assumption that a model 
egpected results ls at jeast partially available. 

To iustrate the principljes of how reglon Segmentation can be handied im 
ATLAB， we develop next an M-function, called regiongrow,to do baslic re- 
n growWing. The syntax for this function is 


(c) 居 们 玉 i 三 包 ftoralliand 六 关 小 
(中 PRN) =TRUEfori = 1 工 2,...，7 
ja the comlextofie  (e) PRIURI) = FEALSEforany adjacent repgions Ri and 人 


CCzSSiO7 ET ecrioi 
94 pvo disjointre- Here, PRAi) is alogical predicate defined over the points in set Rand gj is: 
SIO15 iand Rare nullset 


Said 1 be adjace 有 ti 洲 
peirinion 加 pzga Condition (a) indicates that the segmentation must be complete' tb 


cotecied eyeIy Pixel must be in a region,Jhe second condition requires that Points i 
OA Teglon be connected in some Predefined SenSse (e.g.,4- or 8-connected)， Condi 
tion (cj indicates that the regions mnust be disjoint. Condition (d) deals wifi 
the Properties that must be satisfied by the pixels in a Segmented region 一 如 
exaImnple PR) = TRUE 让 all pixels in 尺 have the same gray level. Fina 并 
condition (e) indicates that adjacent Tegions 尺 and Ai are ditferent in 记 
Sense of predicate 己 . 


[g，NR，SI，TI] = regiongrow(f，S，T) 


here f js an imnage to be segmented and Parameter S can be an array (the 
me Size as f) or a Scalar, 于 Sis an array,it must contain 1s at all the coordi- 
ates wbere seed points are ljocated and Us elsewhere. 9uch an arTay can be de- 
ermined by inspection, or by an external seed-finding function.IfSis ascalar， 
efines an intensity value such that all the points in f with that value become 
dpoints. Similarly, T can be an array (the Same Size as ff) OoT a SCcalar, IT 1S 
array,it contains a threshold value for each location in f. 芝 Tils scalar,lt de- 
nes a global threshold.Jhe thresphold value(s) is (are) used to test 二 a PIXel in 
ithe imapge is sufficiently similar to the seed or seeds to which it 18 8-connected. 
For example, ES=aandT=b,andweare comparng intensities, then 3 
IXel is said to be similar to a (in the sense of Passing the threshold test) 过 the 
bsolute value of the difference between its intensity and a is less than or 
sequal to b. If in addition, the Pixel in question 1S 8-connected to one oOT IOre 
d values, then the pixel js considered a member of one Or IOoIe Tegions. 
jlar comments hold 让 S andTare arrays, the basic difference being that 
Parisons are done with the appTopriate locations defined in S and corTe- 
Ionding values of T， 

:JIn the output, g js the segmented image, with the members of each reglon 
ing labeled with an integer valiue. Parameter NR is the number of different 
gOns， Parameter SI is an image containing the seed points, and parameter TI 
n image containing the Pixels that passed the threshold test before they 
ere processed for connectivity. Both SI and TI are of the same Size as 于 . 
:The code for function regiongrow is as follows. Note the use of Chapter 9 
mnction bwmorph to reduce to l the number of connected seed points in each 
ion in S (when S is an array) and function imreconstruct to fnd Pixels 


nnected to each seed. 


10.4.2 Region Growing 


As lts name implies, regiom gromingisaprocedure that groups pixels or subl 
gons into larger regions based on predefined criteria for growth.The basica 
Proach ls to start with a Set of “seed”points and from these grow Teglons. by 
appending to each seed those neighboring pixels that have predefined pro 
ties similar to the seed (Such as Specific ranpges of gray level or color). 

9electing a set of one or more starting points often can be based on the Daz 
ture of the problem, as Shown later in Example 10.8. When a Priori inform 
tion is not available, one procedure is to compnute at eveTy Pixel the same set O 
PIOPertes that uitimately will be used to assign pixels to regions during the 
grOwWing piIocess. If the result of these computations shows clusters of values 
the PIXels whose Properties place them near the centrold of these clusters 5 
be used as Seeds. 

The selection of similarity criteria depends not only on the Problem under 
consideration, but also on the type of image data available. For example, the 
anajlysis of land-use satejjite imagery depends heavily on the use of color. This 
Problem would be significantly more difficult, or even impossible, to handle 
without the inherent information avaiiable in color images. When the images 
are monochrome, reglon analysis must be carried out with a set of descTiptors 
based on intensity levels (such as moments or texture) and spatial properties: 
We discuss descriptors useful for region characterization in Chapter 11， 

Descriptors alone can yield misleading results 让 connectivity (adjacency) 
Information is not used in the region-growing process. For example, vjsualize 3 
random arIrangement of pixels with only three distinct intensity values. Grou 
ing PIxels with the same intensity level to form a“region”without paying at- 
tention to connectivity would yield a segmentation result that is meaninglesS 
in the context of this discussion. 

Another Problem in region growing is the formulation of a stopping rule. 
Baslcally, growing a region Should stop when no more Pixels satisfy the criteIia. 
for inclusion in that region. Criteria such as intensity values, texture, and color 
are local in nature and do not take into account the“history”of region growth- 
Additional criteria that increase the power of a region-growing algorithm: 





nction [g，NR，SI，TI] = regiongrow(f，3，T) regiongrow 
35REGIONGROW Perform segmentation by region growing， WE 
[G，NR，SIT，TI] = REGIONGROW(F，SR，T). S can be an array (the 

sanme Size 3aS F) With a 1 at the coordinates of every Seed Point 

and 0Ss elsewhere. S can also be a Single Seed value，SinmiJlarly， 

T can be an array (the Same Size as F) contajining a threshoJld 

value for each pixel in F, T can also be a Scalar，in Which 

case jt becomes a global threshold， 
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true 8editibeierml io 
JogicalfT),crad 
false MegrivaiertL 
IO 10gical(0)， 


上 XAMPELE 10.8: 
Application of 
reglion grTOwing to 
weld porosity 
detection. 
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on the output，G is the Pesult of region growing，with each 
region Jabeled by a different integer，NR is the number of 
regions，SI is the final Seed ;image Used by the algorithm， and 了 
liS the image ConsSisting of the pixels in F that Satisfied the 
threshold test， 


f = double(f); 
% IfSaisascalar，obtain the Seed image， 
if numelL(S) == 
SI = 于 == 9S; 
3S1 = S; 
elSe 
%5sSisan array， ELimninate duplicate，connected seed locations 
5 to reduce the number of 100p executions in the following 
名 Sections of code ， 
SI = bwmorph(S， shrink ' ， 
J =TfTind(SI) ，; 
= 了 flJ)j 55Array of s$eed values. 


co oo oo do wo 


In ) ; 


end 
TI = false(Size( 了 ) ) ; 


forKkK= 1:1ength(St) 
seedvalue = S1(K) ; 
S3= abs(f 一 Seedvalue) <= T; 
TI=TI | S; 

end 


5%5 Use function imreconstruct With SI as the marker image to 

5 obtain the Pregions Corresponding to each seed in S，Function 
% bwlabel assigns a different integer to each connected reglon， 
[g9，NR] = bwJabel(imreconstPuct(SIT，TI) ) ; 
8-connected to at least one pixel in a region to be included in that region. 于 
Pixel is tound to be connected to more than one region, the regions are auto- 
inatically merged by regiongrow. 

了 gure 10.14(b) shows the seed points (image SI). They are numerous in 
his case because the seeds Were Specified Simpily as 3all points in the Image 
With a value of 255. Figure 10， 14(c) 1 Image TI. It shows all the points that 
passed the threshold test; that js，the Points with intensity zi，sSuch that 
- 3 引 三 7. Figure 10.14(d) shows the result of extracting all the pixels in 
gure 10.14(c) that were connected to the seed points. This is the segmented 
mage,g. It ls evldent by comparing thls image with the original that the region 
growing procedure did indeed Segment the defective welds with a reasonabjie 
degree of accuracy 

Finally we note by Ilooking at the histogram in Fig. 10.15 that it would not 
ave been possible to obtain the same or equivalent solution by any of the 
hresholding methods discussed in Section 10.3.The use of connectivity was a 


0 
要 


undamental requirement in this case， 泪 


半 Figure 10.14(a) shows an X-ray image of a weld (the horizontal dark re 
gon) contalning Several cracks and porosities (the bright, white streaks ru 
ning horizontally through the middle of the image), We wish to use function 
regiongrow to segment the regions corresponding to weld failures.These seg 
mented regions could be used for inspection,for inclusion in a database of his 
torical studies, for controlling an automated welding system, and for othel 
numerous appllications，. 

The first order of business is to determine the initial seed points. In this ap 
plication, it is known that some pixels in areas of defective welds tend to have 
the maximum aljlowable digital value (255 in this case). Based in this informa- 
tion, we let S = 255. The next step is to choose a threshold or threshold 3aTT37 
In this particular example we usedT = 65.This number was based on analysis 
of the histogram in Fig. 10.15 and represents the difference between 255 and 
the location of the first major valley to the left, which is representative of the 
highest intensity value in the dark weld reglon. As noted earlier, a pijxel has to 
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FIGURE 10.14 

(a) Image 
showing defectlive 
welds. (bj Seed 
Points.(c) Binary 
image Showing 3j] 
the Pixels (in 
whlte) that passed 
the threshold test. 
(d) Result after 
al the Pixels in 
{(c) were analyzed 
for 8-connectivity 
to the seed points，. 
(Original image 
coOurtesy of 入 - 

T 王 天 Systems， 
Ltd.) 
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FICURE 10.15 
Rilstogranml of 
Fig. 10.14(a)， 


ab: 


FICGURE 10.16 

(aj Partitioned 
image. 

(b) Corresponding 
quadtree. 











































Jas splitting. Satisfying the constraints of Section 10.4.1 requUlres merging only 
cent regions Whose cormbined Pixels Satisfy the predicate 已 Jhat ls, two adja- 
cenf Tegions 及 and Rk are merged only 直 PRIUR6 = TRUE 

The preceding discussion may be summarlized by the following Procedure 


hich, at any step， 


10000 


8000 
Split into four disjoint quadrants any Tegion Ri for which P(R) = 上 AL 上 . 
， When no further SPplitting ls possibje, merge any 二 reglons Ri and 
: Ri for which P(RiURG = TRUE. 

.Stop when no further merging 1S possible. 


6000 


4000 


Numerous variations of the Preceding basic theme are posslble. For exam- 
8 significant simpiification results 让 we allow merging of any two adjacent 
gions Riand 及 寺 each one satisfies the Predicate individually. TIhis resujts in 
uch Simpler [ai faster) algorithm because testing of the Predicate is limlt- 
to individual quadregions.As Example 10.9 shows,this simplification 1S Stll 
pable of yielding good segmentation results in Practice, Using thls apProach 
step 2 of the procedure, aH quadregions that satisfy the Predicate are fed 
也 1s and their connectivity can be easily examined using,for exampjle, func- 
n imreconstruct.This function, in effect, accompjlishes the desired merTg- 
of adjacent quadregions. The quadregions that do not satisfy the Predicate 
e filled with 0s to create a Segmented Image. 
The function in IPT for implementing quadtree decomposition 1s qtdecomp， 
esSyntax of interest in this Section 1S 


2000 





0 0 0 1 20 贡 
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tive js to Subdivide an image Initially into a Set of arbitrary， disjointed regions 
then merge andqd/or spljit the regions in an attempt to satisfy the conditions sta 忌 
in Section 10.4.1. The basics of sphtting and merging are discussed next. 

Let Rrepresent the entire image region and select a predicate 已 One approacF 
for Segmenting &lsto subdivide it Successively into smaller and smaller quadran 
Teglons so that, for any region 尺 ,，P(Ri) = TRUE. We start with the entire 
gon. 开刀 RN) = 下 ALSE, we divide the image into quadrants. 和 Pis FALSE far 
any quadrant, we subdivide that quadrant into subquadrants, and so on. This Pa 
ticujlar splitting technique has a convenient representation in the form of a $ 
cajljed 4gxadtree; that is, a tree in which each node has exactly four descendants; as 
ilustrated in Fig. 10.16 (the subimages corresponding to the nodes of a quadtree 
sometimes are called 9LGQG7eSiO105 OT GUI11108ey). Note that the root of the trees 总 
coOTTesponds to the entire image and that each node corresponds to the subdivision 
of anode into four descendant nodes. In this case, onjy R4 was subdivided further 

开 only spjitting is used, the final partition normally contains adjacent Te8l ODS 
with identical Properties. This drawback can be remedied by allowing merging, a8 


S = qtdecomp(f，Gsplit_ test，parameters ) 


ere f is the input image and S is a sparse matrix containing the quadtree 
tructure, If S(k, m) is nonzero,then (k, m) js the UPpper-jeft corner of a block Im 
the decomposition and the size of the block js S(k, m). Function SPp1Lit_test 
See function spLitmerge below for an exampie) is used to determine whether 
egion is to be split or not, and parameters are any additional Parameters 
separated by commas) required by split_test.The mechanics of this are sim- 
rto those discussed in Section 3.4.2 for function CoLtfilt， 

To get the actual quadregion pixel values in a quadtree decomposition we 
se function dtgetb1Lk, with Syntax 


[vals，r，CcC] = qtgetblk(f，S，fnh) 


yhere vals is an array containing the values of the blocks of size mxminthe 
uadtree _ decomposition of f，and S is the Sparse Imatrix returned by 
tdecomp. Parameters Fr and c are vectors containing the row and coiumn co- 
rdinates of the upper-left corners of the blocks. 

We jllustrate the use of function qtdecomp by writing a basic RN 
merge M-function that uses the simplification discussed earler, In which two 
egions are merged if each satisfies the Predicate indivlidually [pe function， 
hich we call splLitmerge,has the following calljing SyntaxX: 


g = SpLitmerge(f，mindim，epredicate |) 
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Oriner Jorpzy or 
qtdecomp are dy- 
CHSSedl 1 

9ectiom 7.2.2. 
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FLAG = PREDICATE(REGION) which must return TRUE ji the pixels 
in REGION satisfy the predicate defined by the code in the 
functioni otherwise，the value of FLAG must be FALSE. 


where f is the input image and g is the output Image in which each connec 
Teglon ls jabeled with a different integer. Parameter mindim defines the S 这 
the Smallest block allowed in the decomposition; this parameter has to 
Positive integer Power of 2. 

Function predicate is a user-defined function that must be included in 
MAILAB path. Its syntax js 


The following Simple example of function PREDICATE is used in 
Exampjle 10.9 of the book， It SetSs FLAG to TRUE if the 
jintensities Of the pixelS in REGION have 3a standard deviation 
hat exceeds 10，and their mean intensity is between 0 and 125， 


1:ag = predicate(region) therwise FLAG is set to false， 


This function mnust be written so that it returns true (a logical 1) 计 the p 这 
in region satisfy the predicate defined by the code in the function; otherwi 
the value of flag must be false (a logical 0). 上 xample 10.9 illustrates the 
of this function. 

Function spLitmerge has a simple structure. First the Image is partition 
using function gqtdecomp. Function Split_testuses predicate to determij 
让 a region should be split or not. Because when a region is Spjit into four 
not known which (if any) of the resulting four Tegions Will pass the predi 
test Individually, it is necessary to examine the Teglons after the fact to 
whbich regions in the partitioned image Pass the test. Function predicat 
used for this Purpose also. Any quadregion that passes the test is filled wit ]$ 


function fl1ag = predicatetregion) 

sd = std2(region); 

mi = mean2(region) ; 

flag= (Sd> 10)& (mm>0)& (nm< 125)| 


ad image With zeros to guarantee that function qtdecomp will 
plit regions down to size 1-by-1， 

2^neXxtpow2 (max(Size(f))); 

RN] = Sizel(f)， 

padarray(f，[Q 一 NM Q 一 NI，'post')， 


rform Sp]litting 于 irst， 
qtdecomp(f，Qsplit _ test，mindim，funy) ， 


Now merge by jlooking at each quadregion and setting all its 
elementSs to 1 if the block satisfies the predilcate ， 


Get the Size of the largest block，Use fu1l1 because S is Sparse， 
9X = TUlLL(max(S(:)))， 
the Size of the inpait image is brought up to a square whose dimensions are tb 必 es NI 人 
、 本 zer0SsS(Size(f) ); 
ment of function qtdecomp to guarantee that spjlits down to size 1 are possibl MARKER = zeros(sizel(f)); 
5 Begin the merging Stage， 
or K = 1:Lmax 
[vals，P，Cc] = qtgetblk(f，S，K) ; 
if ~isempty(vals) 
s heck the predicate for each of the regions 
% of Size K-by-K with coordinates given by vectors 
生 让 and c， 
for I = 1:1engthfr) 
Xlow = 『f(I); ylow = CU(I); 
xhligh = xlow +K-- 1; yhigh = ylow + K-= 1; 
region = f(xLow:xhigh，y1Low':yhigh) ， 


splLitmerge function g = Spjiitmerge(f，mindim，fun) 

Ce %SPLITMERGE Segment an image using 8 split-and-merge algorithm， 
5 6 = SPLITMERGE(F，MINDIM， GPREDICATE) Segments image F by Using 

%5。 Split-and-merge approach based on quadtree decomposition，MINDIW 

名  (a positive integer power of 2) specifies the minimum dimension 

#% of the quadtree regions (Subimages) allowed， If neceSSary，the 

名 program pads the input image With zeros to the nearest Square 

点 Size that is an integer power of 2. This guarantees that the 

salgorithm used in the quadtree decomposition will be able to 

名 Split the ;image down to blocks of size 1-by-1.。 The result is 

5 cropped back to the original Size of the input image，In the 

和 

具 

务 

咎 


flag = feval(fun，region)， -8Val 
Output， 6，each connected region is labeled with a different if fl1ag 本 
0 eg， g(xlow:xhigh，ylow:yhigh) = 1 feval(fun ， 

MARKER(X1LOWw，yYILow) = 1 下 Actt 
Note that in the function call we use epPREDICATE for the value of end 人 ee 
fun，PREDICATE is a function in the MATLAB path，provided by the- end PQ7Q1TeLer Paranm， 
USer，TItSs Syntax 1IS end See te Aheip page 记 r 
feval jor ornersyHn- 


ix 1jorptyaDPEcapbie 
1 雪上 首 PCIEOH 
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Finally，obtain each connected region and 1abel it With 3 
different integer value Using function bwlabel， 
= bwjlabel(imreconstruct(MARKER，g))， 


Crop and exit 
= g(1: 刘 ，1:N); 


%- -0 交 
function V = Split test{(B，mindim，funy) 

5 THIS _ FUNCTION IS PART OF FUNCTION SPLIT-MERGE. IT DETERMINES 

s% WHETHER QUADREGIONS ARE SPLIT，Tphe function returns in V 

5%5 0gical 1S (TRUE) for the blocks that Should be Split and 

%5 10gical 0s (FALSE) for those that should not， 





oo o 避 





下 开 





















( 恨 


oQ 


CC 


5 Quadregion B，passed by qtdecomp，is the current decomposition of 
多 the image into K blocks of Size m-by-m， 
% KK is the number of regions in B at this point in the procedure. 
k = Size(B，3); 
5 Perform the Split test on each block. If the predicate function 
5 (fun) returns TRUE，the region is Split，So We set the appropriate 
%5 element of VvV to TRUE， E1sSe，the appropriate element of vV is Set to: 
%5 FALSE， 
V(1:K) = false; 
for 工 = 1:K 
quadregion = B(:，:，TIT); 
lf Size(quadregion，1T) <= mindinm 
V(I) = falSe; 
Continue 
= feval(fun，quadregion) ; IGURE 10.17 Image segmentation by a split-and-merge procedure. (3a) Original ee 癌 0 
if 了 1ag :Tesults of Segmentation using function splLitmerge with values of mindim equal to 3 an 
v(I) = true; espectively (Original Image courtesy of NASA.) 
end 光 
end 人 Figures 10.17(b) through (人 show the results of segmenting Fig, 10.17(aj 


Sing function spLitmerge With mindinm vajues of 32, 10, 8, 4， and 2 Tespec- 


EXAMPLE 10.9 时 Fgure 10.17(a) shows an X-ray band image of the Cygnus Loop.The image: 
Image is of size 236 X 256 Pixels.The objective of this example is to segment out of : 
ee the image the“ring”of less dense matter surrounding the dense center. There 
nl gon of interest has Some obvious characteristics that should help in its Se8g 
merpging. mentation. First, we note that the data has a Iandom nature to it, indicating: 
that lts standard deviation Should be greater than the standard deviation of 
the background (which is 0) and of the large central region. Similarly the 
mean value (average intensity) of a region containing data 他 om the outer Ting 
should be greater than the mean of the background (which is 0) and less than- 
the mean of the large, lighter central region, Thus, we should be abje to 568 
ment the region of interest by using these two parameters. In fact, the predi- 
cate function shown as an example in the documentation of functon 
Splitmerge contalns this knowledge about the Problem.The parameters Were 
determined by computing the mean and standard deviation of various TegionS 
in Fig. 10.17(a). 


ersely proportional to the value of mindim， 
.Allresults in Fig.10.17 are reasonable Segmentations, If one of these Images 
ere to be used as a mask to extract the region of interest out of the original 


rated is its abjlity to“capture”ipn function predicate information about a 
roblem domain that can help in segmentation， 潮 


n geography,a watershnedis theridge that divides areas drained by different TiverT 
ystems.A catcprmaei pasz is the geographical area draining into a river Or TeSeT- 
Yoir.The materspned transfprna applies these ideas to gray-scale image PIOocessing 
ina way that can be used to solve a variety of image segmentation Problems. 
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FIGUR5 10.18 
(aj Gray-scajle 


image of dark biobs. 
(b) Image viewed as 


aSurface, with 
labeled watershed 
Tidsge line and 
catchment basins. 


:bwdist 


3a. hb 


FIGURE 10.19 
(a) Small binary 
Imapge. 

(b) Distance 
transform， 
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Catchment basins 



















Understanding the watershed transform requires that we think of a 红 直 
scaje image as a topological surface, where the values of F(x， y) are interpr 
ed as heights. We can, for example, visualize the simple image in Fig. 10.18 
as the three-dimensional surface in Fig. 10.18(b). If we Imagine Tain falling 
thls Surface, it is clear that water would collect in the two areag labeled S 
catchment basins, Rain falling exactly on the labeled watershed ridge ines 
would be equally likely to collect in either of the two catchment basins 卫 
watershed transform finds the catchment basins and Tidge lines in a gray-scal 
Image. In terms of solving image segmentation problemas, the key concept js-te 
change the starting image into another image whose catchment basins are t 
objects or regions we want to identify. 和 

Methods for compnuting the watershed transform are discussed in detail 证 ， 
Gonzalez and Woods [2002] and in Soille [2003]. In particular, the algorithim 
used in IPT is adapted from Vincent and Soille [1991]. 


5 让 六 让 


14.3.1 Watershed Segmentation Using the Distance Transform 













nOnZero-valued pixel. Figure 10.19 illustrates the distance transform Figure 
10.19(a) shows a small binary image matrix。 Figure 10.19(b) shows the corre- 
sponding distance transform. Note that 1-valued pixels have a distance trans- 
form value of 0. The distance transform can be computed using IPT function 
bwdist, whose calling Syntax is 


D = bwdist(ff) 
妨 In this example we show how the distance transform can be used with IPTIs 
Watershed transform to segment circular blobs, some of which are touching 
ach other. Specifically we want to segment the PITeprocessed dowel image, f， 
[1L 1 0 0 0 0.00 0.00 1.00 2.00 3.00 :Shown in Figure 9.29(b).First, we convert the image to binary using im2bw and 
L 1 0 0 00 0.00 0.00 1.00 200 300 raythresh, as described in Section 10.3.1. 


0 0 0 0 1.00 1.00 1.41 2.00 2.24 


>>g9= im2bw(f，graythreshf(f) ); 
1.41 1.00 1.00 100 1.41 
了 Rigure 10.20(a) shows the result. The next Steps arTe to complement thbe 1mage， 


0 compute its distance transform, and then compute the watershed transform of 
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FICURE 10.20 

(alj] Binary imasge. 
(b) Complement 
of image in (3a). 
(c) Distance 
transform. 

(d) Watershed 
ridpe lines of the 
negatlive of the 
distance 
transform， 

(e) Watershed 
rldge lines 
Superlimposed in 
black over 
Original binary 
Image. Some 
OverISsegnientation 
IsS evident. 


下 XAMPLE 10.10: 
Segrmenting a 
binary Image 
USling the distance 
and watershed 
transforms，. 
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WarShed 


Ar 
Sm， 二 






正和 AMPLE 10.11: 
Segmenting 3 
gray-Scale Image 
using gradients 
and the watetshed 
transform. 


















the negative of the distance transform, Using function Watershed. The callini 
syntax for this function 1S 


L = WaterSshed (ff ) 


whereLis alabel matrix, as defined and discussed in Section 9.4. Posltive int 
gers in L correspond to catchment basins, and zero vajlues lndlcate watershe' 
Tidge Pixejls. 


>> gc = -9 

>> D = bwdist(g9c) ; 

>> L = Watershed(-D) ; 
>> 风 = 二 


Figures 10.20(b) and (c) show the complemented image and its distance trans: 
form. Since 0-valued pixels of L are wateIshed ridge pixels, the last line of the 
preceding code compnutes a binary image, w, that shows only these Pixels. Tti 
watershed ridge image is shown in Fig. 10.20(d). Finally, a logical AND of th 
orIiginal binary image and the complement of w Serves to complete the seg 
mentation, as Shown in Fig.10.20(e). 8 


>> g2 = g & -Wi 


Note that some objects in Fig. 10.20(e) were split imnproperly. Ihis ls cajlled 
oOVersegPtettation and is a common Problem with watershed-based Segmenta 
tion methods.The next two sections discuss different technlques for overcom 
ing this difficulty. 


10.3.2 Watershed Segmentation Using Cradients 


The gradient magnitude is used often to preprocess a gray-scale image PTior to 


using the watershed transform for Segmentation. The gradient magnitude 
image has high pixel values along object edges, and low PiIXxel values eVeIy 





where else. Ideally, then, the watershed transform would resujt in watershed 


ridge lines along object edges. The next example illustrates thls concept. 


里 Figure 10.21(a) shows an image, f, containing Several dark blobs. We Start 
by compnuting its gradient magnitude, using either the linear filtering methods 
described in Section 10.1, or using a morphological gradient as described in 


Section 9.0.1. 


>> h = fspecial( Sobel ) |; 

>> fd = double(f) ; 

>>g= Sqrt(imfilter(fd，h， 
imfiltertfd，h' ， 


Feplicate ) ,“ 2 + -.. 
'PepLicate  ) 


Figure 10.21(b) shows the gradient magnitude image, g. Next we compnute the 


watershed transform of the gradient and find the watershed ridge lines，. 


>> L = Watershed(g) ; 
>> Wr = L== 0; 


IT 


芭 


一 所 

局 < 是 js 二 下 

有 上 吕 

Ra El 

于 

As Fig. 10.21(c) shows, this is not a good segmentation TesSult; there are too 

many Watershed ridge lines that do not coIrespond to the objects in whlich we 

are interested,This is another example of oversegmentation. One approach to 

this problem is to smooth the gradient image before computing its wateTshed 


transform., Here we use a close-opening, as described in Chapter 9. 


g2 = imcLlose(imopen(g，ones(3,3))，ones(3;3))1 
L2 = Watershed(g2) 1 

wr2 = L2 == 0 

f2 = 隔 ; 


f2(wr2) = 255 


The last two lines in the preceding code Superimmpose the watershed ridgelines 
in wr as white lines in the original image. Figure 10.21(d) shows the Superlim- 
posed result.Although improvement over Fig. 10.21(c) was achieved,there are 
still some extraneous ridge lines, and it can be difficult to determine which 
catchment basins are actually associated with the objects of interest. [he next 
section describes further refinements of watershed-based segmentation that 
deal with these difficulties， 髓 
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FIGURE 10.21 

(a) Gray-scale 
image of small 
blobs. (b) Graditent 
Imagnitude image. 
(c) Watershed 
transform of (b)， 
showing Severe 
overIsegmentation， 
(d) Watershed 
transform of the 
Smoothed gradient 
Image; SOIme 
oveIsegmentation 
is Still evlident， 
(Original image 
COUrtesy of DT. >. 
Beucher， 
CMIMVEcole de 
Mines de Paris.) 
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10.5.3 Marker-Controlled Watershed Segmentation 


Direct application of the watershed transform to a gradient lmage usualiy， 
leads to oversegmentation due to noise and other local irregularities of 有 
gradient. The resulting Problems can be Serious enough to ITender the resu 
virtually useless. In the context of the present discusslon， this means 3 1arge 
number of segmented regions. A Practical solution to this problem ls to linii 
the number of allowable regions by incorporating a PrIeprocessing Stage dess 
signed to bring additional knowledge into tbe Segmentation procedure. 
An approach used to control ovyersegmentation 1S based on the concept 呈 


下 [MA 
三 树 


册 
7 放 
下 


markers.A JarKeris aconnected component belonging to an image. Ye wot RN SEE | NS 
和 ， 导 TELE TREE 不 
like to have a set of interaal markers which are inside each of the objects of 也 yy 让 


terest, as well as a set of exterzal markers, which are cohntained wlthm the bacK 
ground, These markers are then used to modify the gradient Image sin 
procedure described in Exampie 10.12. Various methods have been used f 
computing internal and external markers, mmany of which involve the linear 全 
tering, nonlinear filtering, and morphological processIng described in Prevloss 
chapters. Which method we choose for a Particujar apPlication is highly depeiE 
dent on the specific nature of the images associated with that applcation， 


FXAMPLE 10.12: 澧 This example applies markercontrolled watershed segmentation to 也 
Ilustration of electrophoresis gel image in Figure 10.22(a). We start by considering ther 
markercontrolled sults obtained from computing the watershed transform of the gradient imag 


watershed ith h 
segmentation. WwWlithout any oOtneT PITOCeSSInS. 
>> h = fspeclial(' Sobel ) 
>> fd = double(f); 
>> gg = sqrt(imfilter(fd，h，'repiicate ) ”2 + 
imfilter(fd，h'， replicate ) .” 2) 
>> |L = Watershed(g) 
>> WP = 芋 艺 DO 
We can see in Fig. 10.22(b) that the result is severely oveIsegmented， duei 
part to the large number of regional minima. [二 function imregionalnmzinc 绽 
computes the location of all regional minima in an image. Its calang SyntaXx 1 
-negionalmin rm = imregionalmin(f) 


where fis a gray-scale image and rmis a binary image whose foreground Pix- 
els mark the locations of regional minima. We can use imregionalmin On the 
gradient image to see why the watershed function produces SO many sInal 
catchment basins: 


10.22 (a Gel image.(b) GOYVersegmentation Tesujlting from applying the watershed transform to the 
gradlent magnitude image. (c) Regional minima of gradieat magnitude. (d) Internal markers. (e) External 


markers. (人 Modified gradient mapsnitude. Segmentation result，(Origtnal i 
于 Beucher， CMIMVEcole des Mines de Paris.) ee 和 


>> rm = imregionalmin(g) 


Most of the regional minima locations Shown in Fig. 10.22(c) are VeIY sh 引 - 
low and represent detail that is irrelevant to our segmentation Problem, 1 
eliminate these extraneous minima we use IPT function imextendedmlim， 
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whlch computes the set of “ijow spots” in the image that are deeper (by a 
tain height threshold) than their immediate surroundings. (See Soille [200: 
tor a detailed explanation of the extemded 7zazipzpaec Parmrsjorpz and related op 


ations.) IJhe calling syntax for this function is Shed transform of the markermodified gradient image and look at the result- 


nig watershed ridgelines: 


sf 寂 、 


6 es 。 。 4 
tndednin im = imextendedmin(f，h) 
8 L2 = Watershed(g21) ; 


[和 
凤 最 


f2 = 和 


where fis a gray-scale jimage,h is the height threshold,and imisa binary f2(L2 == 0) = 255; 
SR 


whose toreground pixels mark the locations of the deep regional minima ] 


we use function imextendedmin to obtain our set of internal ma . . 1 ， 
We le ]ast two jines Supefimnpose the watershed ridge lines on the original image. 


， ， - ved Segmentation, ls Shown in Fig,. 10.22(g). 误 
>> im = imextendedmin(f，2)， ie result, a much-improved seg 8 (8) 
>> fim = 于 |; 


>> fim(Iim) = 175; Marker selection can range from the simple procedures just described to 


nsiderably more complex methods involving Size, Shape, location, relative 
ances, texture content, and So on (See Chapter fl regarding descriptors)， 
fhe polint Is that Using markers brings a prIiori Knowledge to bear on the seg- 
mentation problem. Humans often aid segmentation and higher-level tasks in 
eryday Vlislon by Using a PTiori knowledge,one ofthe most familiar being the 
ise of context, Thus, the fact that Segmentation by watersheds offers a frame- 
ok that can make eftective use of thls type of knowledge is a significant ad- 
ntage of thlis method. 


The last two lines superimpose the extended minima locations as grTay 
on the original image, as shown in Fig. 10.22(d). We see that the resulting blo 
do a Ieasonably good job of “marking”the objects we want to Segment.， 
Next we mnust find external markers, or pixels that we are confident belors 
to the background,.The approach we follow here js to mark the backgroun 
tnding pixels that are exactly midway between the internal markers. SUIP 
ingly we do this by solving another watershed problem， Specifically, we co 
pute the watershed transform of the distance transform of the internal maK 
Image, Im: 





image segmentation ls an essential preliimninary step in most automatic pictorial pat- 
ern-recognition and scene analysis problems. As indicated by the range of exarmples 
Tesented in this chapter,the choice of one segmentation technique over another is dic- 
ated Imostly by the particular characteristics of the problem being considered,. The 
nethods discussed in this chapter, although far from exhaustive, are representative of 
echniqtues used commonjly in practice. 


>> Limn = WaterSshed(bwdist(im)); 
>> em = Lim == 0; 


Figure 10.22(e) shows the resulting watershed ridge lines in the binary image 
em. since these Tidgelines are midway in between the dark blobs marked by im 
they should serve well as our external markers. 

lven both internal and external markers, we use them now to modify 也 
gradient image Using a prfocedure called maipipaa iazposition. The minima lm: 
Position technique (see Soille [2003] for details) modifies a gray-scale image 
s9 that regional minima occur only in marked locations. Other pixel values are 
“pushed up”as necessary to remove all other regional minima. IPT functionsse 
Imimposemin impljements this technique. Its calling syntaxX is 


0 
中 / 凡 人 
| 


-minposemin mp = imimposemin(f，mask) 
CN 





where f is a gray-scale image and mask is a binary image whose foreground: 
PIXels mark the desired locations of regional minima in the output imagen 
wemodify the gradient image by imposing regional minima at the locations 呈 
both the internal and the external mmarkers: 
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rom the definition given lin the Previous paragTaph, 让 follows that a 
undary 1S a connected set of points. The points on a boundary are Salid to be 
iered 让 they form a clockwise or counterciockwise sequence.A boundary is 
刘 to be ipipaaliy comnmected 过 each of its points has exactly two 1-valued 
De ighbors that are not 4-adjacent. An inaterior poinil is defined as a point any- 
Bre in a Teglon, except on its boundary. 

The material in this chapter differs significantly from the discussions thus 
n the Sense that we have to be able to handle a mixture of different tyPes 
data Such as boundatries, regions, topological data, and so forth.Thus, before 
oceeding, we pause briefly to Imtroduce some basic MATLAB and IPT con- 
pts and functions for use later in the chapter. 


,1.1 Cell Arrays and Structures 


begin with a discusslion of MATLAB's cell arrays and structures,， which 
re introduced briefly in Section 2.10.6. 


1] Arrays 

1] arrays Provide a way to combine a mixed set of objects (e.g., nuUmbers， 
Haracters, matrices, other cell arrays) Under one Variable name. For example， 
uppose that we are working with (1) an uint8 image, f,ofsize S512 X 512; (2) 
equence of 2-D coordinates in the form of rows of a 188 x 2 array, b; and 
3) a cejl arTay containing two character names, char array = {farea'， 
centroid }. JIhese three dissimilar entities can be organized into a single 
arlable, C, using cejll arrays: 


Peiez0 


After an Image has been segmented into regions by methods such as those di 
cussed in Chapter 10, the next step usually is to represent and describe the afg 
gregate of segmented “Taw”Ppixels in a form suitable for further compnute 
Processlng. Representing a Tegion inyolves two basic choices: (1) We can Tep 
Iesent the Teglon in termas of its external characteristics (its boundary), or (2 
We can Iepresent it in terms of its internal characteristics (the pixels compris 
ing the region). Choosing a representation scheme, however is only part of 也 
task of making the data useful to a compnuter. The next task is to descripe 也 
reglon based on the chosen representation. For exaImpjle, a region may b 
repreyented by its boundary, and the boundary may be descriped by feature 
Such as lts length and the number of concavities it contains. 和 


= 人 { 人 fb char array} 


here the curly braces designate the contents of the cell array.Typing C at the 
rompt would output the following results: 


gional properties, such as color and texture. Both types of rePresentations- 
sometimes are used in the same application to solve a problem. In either case 
the features selected as descriptors should be as insensitive as possible to vari 
ations in region size, translation, and rotation, For the most part, the descrip 和 escription of some of their properties instead.To address the complete con- 
tors discussed in this chapter Satisfy one or more of these PIOoperties， tents of an element of the cell, we enclose the numerical location of that ele- 


[512x512 Uint8] 1488x2 doub1e] {1X2 Cel1+} 





Background 


Aregionm is a connected component, and the poumdary (also called the porder or 
coOmiour) of a region is the set of pixels in the region that have one or more neigh- 
bors that are not ip the region. Points not on a boundary or region are called 
packgrourd points. Initially we are interested only in binary images, SO Tegion OF : 
boundary points are represented by ls and background points by 0s. Later in this - 
chapter we allow Pixels to have gray-scale or multispectral values. 


area' ' centroid' 


or we can use function celldisp: 


11.1 机 Background 427 


428 ”Chapier 11 查 Representation and Description 11.1 小 ES 429 



















和 本 Suppose that we want to write a function that outputs the averTage intenslty ”EXAMPLE 11.1: 


fan image, its dimensions, the average intensity of its rows, and the average 全 Simple 





ssGeEldisp 
TS ans{f{1} 
一 二 tensity of its columns. We can do it in the“standard”way by writing a func- Re ee 
(2) on of the form 
ans 
centroid Unction [AI，dim，AIrows，AIcolS] = image_stats(Tf) 


襄 = Size(f)j 
f = mean2() ; 
owWS = meant(f，2); 


Using parentheses instead of curly braces on an element of C gives a descr 
TIcols = mean(f，1); 


tion of the variable, as above: 


0 here f is the input image and the output variables correspond to the quanti- 
ans = es just mentioned, Using cejlls arrays, we would write 
{1XxX2 Cel1} nction G = image stats(Tf) 
= SIze(T) ; 
We can work with specified contents of a cell array by transferring them ft = mean2 (了 ) ; 
numeric or other Pertinent from of array. For instance,to extract f foma C we: = mean(T，<) 
忆 = mean(fT，1); 


>> 千 = Cf{ 人 Ti ; ， 
riting G(1) = {size(f)},and similarly for the other terms, also 1S accept- 


e. Cell arrays can be multidimensional. For instance, the PITevious function 


Function size gives the size of a cell array: Te ssess 


>> Sizel(C) unction H = image_ statS2(Tf) 
ans = 1) = {Size(f)]}; 
1 3 H(1，2) = {mean2(T)}; 
HG2，1) = {fmean(f， ee 
1(2，2) = {mean(f，T)}; 
Function celLIfun, with Syntax 
光 Or we collld have used H{f1,1} = Size(f),andsoonftorthe other varlables 
D = Cellfun( 'fname' ，C) 全。 Additional dimensions are handjed in a similar manner. 





See ipe cellfun applies the function fname to the elements of cell array C and returns the Te- 
0 aistojf suits in the double array D. Each element of D contains the value returned by 
人 fname for the corresponding element in C.The output array D is the same size 

as the cell array C. For example， ee 


>> G = image stats(f) 
image_statsS2 (下 ) ; 


V 
V 

工 
由 


>> D = Cellifun('1length' ，C) 
[1x2 doub1le] [1 ] [512x1 doub1e] [1x512 doub1le] 
512 188 2 


in other words, length(f) =512,1ength(b)=188and length(char_ array) = 
Recall from Section 2.10.3 that length(A) gives the size of the longest dimension 
of a multtdimensional array A. 

Finajy keep in mind the comment made in Section 2.10.6 that cell arTay5 
contalin copies of the argumients, not pointers to those arguments. Thus, 让 any 
of the arguments of C in the preceding example were to change after C was ce， 
ated,that change would not be reflected in C. 


[ 1x2 doub1e] [ 1 ] 
[512xt doubl1e] [1x512 doubl1e ] 





fwe want to work with any of the variables contained in G, we extract it by ad- 
dressing a Specific element of the cell array, as before. For instance, 过 we want 


0 work with the Size of f, we Wiite 
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人 sizel(sS) 
OT 二 

] 
>>V= H{f1;1+1+ 


Note that s itsejf ls a scalar, with four fieids associated with it in this case， 

:We see in this exampjie that the logic of the code is the same as before, bnut 
ie organlzation of the output data is much clearer. As in the case of cell ar- 
s the advantage of using Structures would become even more evident 过 we 
ere deajling with a larger number of outpnuts. 图 


where visal Xx2vector.Note that we did not use the familiar command [有 
N] = Gf1} toobtaimn the size ofthe image.This would cause an error becaug 
only functions can produce mnultiple outputs. To obtain M and N we would Us 
M=v(1)andN = v(2). 



























The economy of notation evident in the preceding example becomes eye 
more Obvious when the number of outputs is large. One drawback is the loss 0| 
clarity in the use of numerical addressing, as opposed to asSsigning names tao 
the outputs. Using structures helps in this regard. 


"The Preceding iustration used a single structure. If instead of one image， 
ehad CO images organized in the form ofan MXxNx @C array the function 
ould become 


Unction s = image_stats(f) 


9tructures 5 
517UCtures are Similar to cell arrays in the sense that they allow grouping ofa or K = 1 :K(3 ) 


collection of dissimiilar data into a single variable. However, unlike cell arTayS 
where cells are addressed by numbers, the elements of structures are ad- 
dressed by names called Fields. 


S(k),dimn = Size(f(:，:，Kk 
sf(k).AI = mean2(f(:，:，k 
sS(K),AIrows = mean(lf(:，: 
S(k).AICcols = mean(f(:，: 
EXAAMPIE 12 村 Continuing with the theme of Example 11.1 w 记 clarify these concepts， 


A simple Using Structures, we Write 
lllustration of 


we function s = image_ stats(f) 


S.dim = Size(f); 
Ss.AI = mean2( 和 下 ) ; 
sS.AIrowS = mean(f，2); 
S.ALICcols = mean(f，1); 


ys, Structures can have any number of dimnensions, their most common form 
a Vector as In the Preceding function. 

”了 xtracting data from a field requires that the dimensions of both s and the 
field be keptin mind.For exXample, the following statement extracts all the val- 
ues of AIrows and stores them in v: 


了? 


where s is a structure. The fields of the structure in this case are AI (a. 
SCalar)， dim (a  X 2 vector), AIrows (an M X 1 vectorj, and AIcols (a 
1 X MY vector), where M and N are the number of rows and columns of the: 
Image. Note the use of a dot to separate the structure from its various fields. 
The field names are arbitrary，but they must begin with a nonnumeric: 
character. 

Using the same image as in Example 11.1 and typing s and size(s) atthe 
PIOmpPt gives the following outpnut: 


v(:，k) = S(k).AIFrows; 


Note that the colon is in the first dimension of v and that kis in the second because 
iof dimension 1 x CandAIrowsis of dimension M X O.Thus becausek goes 
om 1 to CO,vis of dimension M X O. Had we been interested in extracting the 
alues of AIcols instead, we would have used v (kK，: ) im the loop， 

Square brackets can be used to extract the information into a vector or ma- 
Mix 这 the field of a structure contains scalars、 For exXampje，Suppose that 
'Area contains the area of each of 20 regions in an Image. Writing 





>> S = 
S 二 
dim: [5t+2 512] 
AI: 1 
AIFowS: [512xl doub1e] 
AIcColS: [1x512 double] 


W=[D.Area]; 


eates a 工 X 20 vector w in which each elements is the area of one of the 
Teglons. 
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The 2-D coordirates of regions or boundaries are organized in this chapter 
the form ofzpPp X2 arIayS， where each IOwW is an (X, y) coordinate Palr， and 
hp is the number of points in the region or boundary. In some cases it is neces- 
ary to Sort these aITays. Function sortrows can be used for this Purpose: 


As with ceil arrays, when a value is assigned to a Structure field, MATLAR 
makes a copy of that value in the structure. 寺 the original value is changed at a 
later ttme, the change is not reflected in the Structure. 


1 SomeAdditional MATLAB and IPT Functions Used in Thi 
Chapter 


Function imfil1 was mentioned briefly in Jable 9.3 and in Section 9.5.2. This 
function performs ditterently for binary and Intensity image inputs, so, to heip 
clarify the notation in thls Section, we let fB and fI repiresent binary and i 
tenslty 1mages, Tespectively. 坟 the outpnut is a binary image, we denote it by 9 
otherwise we denote Simply as g. The syntax 


Z = Sortrows(S) 


is function Sorts the rows of S in ascending order.Argument S must be either 
atrix or a column vector. In this chapter, sortrows is used only with mpP X 2 
ys. 开 Several TowWSs have identical first coordinates,they are Sorted in ascend- 
ing order of the Second coordinate. If we want to sort the rows of S and also 
Jiminate duplicate Tows, we use function unique, wbich has the syntax 

gB = imfill(fB8，1Locations，Cconn) 
[zZ， mm nl = unique(S，“' rows ') 
performs a food-fill operation on background Pixels (i.e., it changes bac 
ground plxels to 1) of the input binary image fB, starting from the Points spe 
ified in Jocations.TIhis parameter can be an7m X1vector (nisthenumber of 
locations), in which case it contains the /iear ipmdices (See Section 2.8.2) of 世 
starting coordinate locations. Parameter 10cations can also be an1 X 2Dm 
tTrix, In which each row contains the 2-D coordinates of one of the starting 
cations in fB, Parameter Conn Specifies the connectiyity to be used on the 
background pixels: 4 (the default), or 8. If both location and conn are omi 
ted from the input argument, the command g8 = imfilL1(fB) displays theb 
nary Image, fB, on the Screen and lets the user select the starting locations 
using the mouse. Click the left mouse button to add Points. Press BackSpaceo 
Dejlete to remove the previously selected point. A shift-click, right-cjick, o 
double-click selects a final point and then starts the 了 fill operation. Pressin 
Return fnlshes the Selection without adding a point. 

Using the Syntax 


where z js the Sorted array with no dupjlicate rows, anadmand n are such that 
St(m，:)andS=z(n，:).Forexample,ifS=[12;65;12; 43],then 
[1 2;4 3;6 5],m= [3; 4j】 2j,andn=[1; 3; 1; 21.Note that zjis 

anged in ascending order and that m indicates which rows of the original 

ay Were Kept. 

Frequently, it ls necessary to Shift the rows of an array up, down, or Sideways 

pecified number of positions. For this we use function circshift: 


Z = Circshift(S，[ud 1Lr]) 


vbere ud is the number of elements by which S is shifted up or down. If ud is 
sitive, the Shift js downi otherwise it is up.Similarly,if Lris Positive,the array 
hiftted to the Tight Ir elements; otherwise it is shifted to the left. If only up 
ddowa shifting is needed, we can use a Simpler Syntax 

g8 = imfill(fB，conn， 'holes ') zZ = Circshift(S，ud ) 

S 1s an image, circshift js really nothing more than the familiar scrolzzag 


fills holes in the input binary image.A HAole is a set of background pixeis that 
up and down) or Partipng (right and left), with the image wrapping around， 


cannot be reached by filling the background from the edge of the image, As 
before, conn speclfies connectivity: 4 (the default) or 8. 


了 he syntax 1.1.3 Some Basic Utility M-Functions 


asks Such as converting between regions and boundaries, ordering boundary 
omts in a contiguous chain of coordinates, and Subsampling a boundary to 
mplify its Tfepresentation and description are typical of the processes that are 
ployed ronutinely in this chapter. The following utility M-functions are used 
these Purposes. To avoid a loss of focus on the main topic of this chapter， 
e discuss only the syntax of these functions, The documented code for each 
on-MAILAB function is included ip Appendix C, As noted earlier, bound- 
arles are represented as AD X 2 arrays in which each row Tepresents a2-D pair 
coordinates. Many of these functions automatically convert 2 X HP coordi- 
ate arTays to arTays of Size 2PD X 2. 


g= imfilLi(fI，conn，'holes ') 


fills hotles in an input intensity image, fI. In this case, a hole is an area of dark. 
Pixejls SurTounded by hghter plixels. Parameter conn is as before. 

Function find can be used in conjunction with bwJlabeltoreturn vectoIS Of 
coordinates for the pixejs that make up a Specific object, For example, 让 [gB， 
num] = bwlabelL(fB) yjields more than one connected region (ie., num > 1) We- 
obtain the coordinates of Say, the Second reglon Using 


[r，c] = find(g == 2) 
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is function inserts new boundary Pixels wherever there is a diagonal con- 





ee hection， thus Producing an output boundary in which Pixels are only 4- 
肖 疯 nnected, Code listings for both functions can be found in Appendix C. 
es B = boundaries(f，conn，di Function 
traces the exterior boundaries of the objects in f, which ls asSsumed to be a g = bound2im(b，M，N，x0，Yy0) bound2im 
nary jimage with 0s as the background. Parameter Conn specifies the desi 8 en 
connectivity of the output boundaries; lts values can be 4 of 8 (the defau renerates a binary Image, g, of size M X NA, with ls for boundary points and a 
Parameter dir Specifies the direction in which the boundaries are traced: 诡 acCkgTOuUnd of 0s. Fararmeters x0 and y0 determine the location of the mini- 
Values can be 'cwW' (the default) or "ccw ， 8 a clockwise Or counte Im X- and -coordinates of b in the image. Boundary b must be an PP X 2 
clockwise direction. Thus, 让 8-connectivity and a 'cw” direction are accep 2XH1HD)array of coordinates where, as mentioned earlier, npD is the num- 
able, we can use the Simpler Syntax rof pomts. 二 xX0 and y0 are omitted,the boundary is centered approximate- 
in the gxAarray 开 inaddition,MandNare omitted, the vertical and 
B = boundaries(f) horizontal dimensions of the image are equal to the height and width of 
oundary b. 下 fanction boundaries finds mujltiple boundaries, we can get all 
人 he coordinates for use in function bound2im by concatenating the varfious el- 
Output 8 in both Syntaxes 1$ 3 cell arTray whose elements ae the coordinates< ments of cell array B: 5 
the boundaries found. Ihe first and last points in the boundaries returnedb 机 二 三 
function boundaries are the same. TIhis produces a closed boundary. b = cat(1，8{ :1) cat operator See 
As an example to fix ideas, Suppose that we want to find the boundary: also Examzpie 17.73. 
the object with the longest boundary in image f (tor Simplicity we assumet here the 1 indicates concatenation along the first (vertical) dimension of the 
the longest boundary ls unlque). We do this with the folowing sequence 0 Ta 
commands: Function 
>> B = boundaries(f); Is，SuU] = bsubsamp(b，gridsep) 0 
>> d = cellfun('length'，B); 汪汪 汪汪 
See gerriop 2 103 >> [max d，kj = max(d); sampjes a (Single) boundary b onto a grid whose lines are separated by 


dsep Pixeis. TIhe output s js a boundary with fewer points than b, the num- 


WV 


Jorar explararionOF > 
1 Le OF ncHom 


= B{K(1T) 


失色 关 ， 
Vector v contains the coordinates of the longest boundary in the input imag boundary Polnts scaled so that transitions in their coordinates are unity 
and Kk js the corresponding region number; array vis of size np x2.TIhe last state 1 18 Useful for coding the boundary Using chain codes, as discussed in 
ment simply selects the first boundary of maximum length 这 there is more than yection 41.1.2. Tt is required that the points in b be ordered in a clockwise or 
one Such boundary. As noted in the previous paragraph, the first and jast point unterciockwise dir ection， 
of everTy boundary computed using function boundaries are the Same, SO IO When aboundary is subsampled USIng bsupsamp,its Polnts cease to be con- 
v(1，:) isthesame asrowv(end，?:)， cted. [hey can be Teconnected by lsing 
Function bound2eight with Syntax z = connectpoly(s(:，1)，s(;， 2)) connectpoly 
2 
bound2eight b8 = bound2eight(b) Where the rows of s are the coordinates of a subsampled boundary It is re- 


区 MegavhNwy mywsrnvhtoewrnlownn ev eeoomy 


二 .duired that the points in s be ordered, either in a clockwise or counterclock- 
se direction,. The rows of onutpnuit z are the _ coordinates of a connected 
undary formed by connecting the points in s with the shortest possible 
th consisting of 4- or 8-connected straight segments. This function is useful 
I producing a polygonal，fully connected boundary that is generally 
moother (and simpler) than the original boundary b, from which s was ob- 
ained. Function connectpoly also is quite useful when working with func- 
ions that generate only the vertices of a polygon、such as minperpoly， 
1Scussed in Section 11.2.3. 


TeIDOves from b plXxejls that are DecesSsary for 4-connectedness bUt not DeCeS- 
sary for 8-connectedness，leaving a boundary whose pixels are onjy: 
8-connected. Input b must be an np X 2 matrix, each row of which contains- 
the (xz, y) coordinates of a boundary pixel. It is required that b be a closed， 
connected set of PlxXels ordered sequentially in the clockwise OF coOUnteI 
clockwise directlon. IJhe same conditions apply to function bound2four: 


bound2four b4 = bound2four(b ) 


er ve nrovrorcmem ee 
本 5 二 
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rclockwise direction in Fig. 11.1) that separate two adjacent elements of the 
iode.For instance, the first difterence of the 4-direction chain code 10103322 1S 
33030. IE we elect to treat the code as a circular sequence, then the first elje- 
ent of the difference is computed by Using the fransition between the last 
d first components of the chain, Here, the result is 33133030. Normalization 
th rfespect to arbitrary rotational angles is achijeved by orienting the bound- 
y with respect to Some dominant feature, Such as lts major aXlS, 3S discussed 
Section ],3.2. 


Compnuting the integer coordinates of a straight line joining two points 这 
basic tool when working with boundaries (for example,functon connectpol 
requires a Subfunction that does this), IPT function intline is well suited fo 
thjs PUTPogse. Its SYntaxX 1S 


[Xx，yY] = int1line(xf，Xx2，y1T，yY2) 





下 贡 区 二 二 二 where (x1,y1y and (x2,，y2) are the integer coordinates of the two Points 地 








docttmenied 7PT be connected. The outputs x and y are column vectors containing the integemt Function fchcode, with syntax 
x- and y-coordinates of the straight line joining the two Points. 
code Binctaded 演 ， 
M4ppendir C C = fchcode(b，conn，dir) ee 要 
ER mputes the Freeman chain code of an ?P X 2 set of ordered boundary 
As noted at the beginning of this chapter, the segmentation techniques di points stored in array b. The output c is a Structure with the folowing fields， 
cussed in Chapter 10 yield raw data in the form of pixels along a boundar iere the nuUmbers inside the parentheses indicate arTay Size: 
pixels contained in a Tegion.Ajthough these data Sometimes arIe Used dire 
to_ obtain descriptors (as in determining the texture of a Tegion)，stand ee 5 0 chain code (1 X 7 
P 8 8 三 
practice is to use schemes that compact the data into representatlons that a 和 加 ee ee 
considerably more useful in the computation of descriptors. In thls section We 而 加 ee ne ee 。 人 
discuss the implementation of various repTesentation apProaches. 人 号 ) 
cC.x0y0 = Coordinates where the code starts (1 X 2) 
1 Chain Codes Tameter Conn specifies the connectivity of the codei its value can be4or 8 
Chain codes are used to represent a boundary by a connected Sequence ， ie defaultb) .人 A value of 4 is valid only when the boundary contalins no diago- 
straight-line segments of specified length and direction. Typically, this repr nal transltilons， 缚 汪 
sentation is based on 4- or 8-connectivity of the segments. The direction of 了 arameter dir specifies the direction of the output code: 下 Same 18 SPec- 
each segment is coded by using anumbering scheme Such as the ones Shown 也 汪 ， ed, the code is in the same direction as the points in b. LSing reverse 
Figs. 11.1(a) and (b), Chain codes based on this scheme are referred to 由 es the code to be in the opposite direction. TIhe default 1 Same . Thus， 
人 Titng c= fchcode(b ， conn) USe5S the defauit direction,andc=fchcode(b) 
The chain code of a boundary depends on the starting point, However, the Uses the default connectivlty and direction， 
code can be normalized with respect to the starting Point by treating lt as a CI 2 . . 
cular sequence of direction numbers and redefining the starting point so that tbe 了 gure 11.2(a) shows an image,f, of a Circular Stroke embedded in SPecujar 0 了 .3: 
resulting sequence of numbers forms an integer of minimum magnitude. We can jse. The Eee of thls example 1S to obtaln the chain code and first dlifter- | 攻 
normalize for rotation [in increments of 90* or 45$", as shown in Figs.11.1(a) and ce of the object's boundary. It 1 obylous by looking at g. 1L2(a) that the its variations 
(b)] by using the jzrst dijference of the chain code instead of the code itself Th e fragments attached to the object Would result mn a verIy irregular bound- 
difference is obtained by counting the number of direction changes (in a courn- ,not truly descrlpbve of the general shape of the object- smoothlng ls arou- 
ine process when working with noisy boundaries. Figure 11.2(b) Shows the 
esult, g, of using a9 X 9 avyeraging mask: 
3 5: “ 
FIGURE 11.1 1 = fsSpecial( average ，9) | 


让 


(a) Direction imfilter(f，h， ”replicate ) 


numbers for 

(a) a 4-directionaj 
chalin code, and 

(bj an 8-directional 
chaln code. 


《人 
(了 


ussed in the PIevious Section: 
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bre, we used a grid Separation equal to apPIOXimaately 10% the width of the 

fage, Which in this case Was of size 570 X 370 pixels. The resuiting Points can 

displayed as an image [Fig.11.2(e)]: 


>> B = boundaries{(g) 
Asin the 韦 ustration in Section 11.1.3,we are lnterestedin the longest bounda 
>> d = cellfun('tength'，8B); 


>> [max d，kj = max(d); 
> 


g2 = bound2im(S，M，N，min(s(:，1))，min(s(:，2))) 
saconnected Sequence [Fig.11.2(0] by using the commands 


= Connectpoly(s(:，1)，S(:，2)) |; 
bound2im(cn，M，N，mlintfcn(:，1))，nmint(cn(:，2))); 


G 
二 
| 


The boundary imasge in Fig. 11.2(d) was generated using the commands: 


3 
To 
中 


>> [MNj = size(g); 
>> g = bound2im(b，M，N，min(b(:，1))，min(b(:，2))》); 


Obtaining the chain code of b directly woujd result in a long Sequence Wi chain code is obtained from the scaled Sequence su: 
Small variations that are not necesSSarily Tepresentative of the general shape 
the Image, Thus,， as js typical in chain-code Processlng， we Subsamplet 
boundary using function bsubsamp discussed in the Preyvlous Section: 


>> [s，su] = bsubsamp(b，50) ; 


Yy examining c,fcc ,Fig.11.2( fand c,.x0Oy0 we see ihat the code starts on 
left of the figure and proceeds in the clockwise direction, which is the same 
Tection as the coordinates of the boundary. 浊 


2. Polygonal Approximations Using Minimum-Perimeter 
Polygons 
digital boundary can be approximated with arbitrary accuracy by 3 polygon. 


Foraclosed curve,the appiIoximation js exact when the nuUmber of segments in 
he polygon is equal to the number of Points in the boundary, So that each Pair 





3 人 
de 


FICURE 11.2 (a) Noisy lmage. (b) Image Smoothed with a 9 X 9 averaging mask. (c) Thresholded ima8 
(d) Boundary of binary Image， (e) 95ubsampled boundary， 人 Connected points from 人 e) 
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ab 
FIGURE 11.3 
(a) Object 
boundary 


enclosed by cells, 


(b) Minimum- 
Perlimeter 
polygon， 







of adjacent points defines an edge of the polygon. 了 practice, the goal ol ab 
polygonal approximation is to use the fewest vertices possible to capturet 
“essenice"”of the boundary shape, 


Aparticularly attractive approach to polygonal apProximation ls to find 了 ht 
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FIGURE 11.4 (a) Region 
enclosed by the Inner wajl 
of the cellular comPpjlex in 


71aipiptuUmz-perinteter polygon (MPP) of a region or boundary. TIhe theoretica 呈 0 (…) and 
underpinnings and an algorithm for finding MPPs are discussed ln the cla ee concave (oj corner 











































Paper by Sklansky et al. [1972] (see also Kim and skjansky [1982]). In this 
ftion we present the fundamentals of the algorithm and give an Mi-function 
Plementation of the Procedure. LIhe method is restricted to SPie polygon 
(i.e.,polygons with no sejlf-intersections).Also, regions With Peninsujlar protru 
sions that are one Pixel thick are excluded. Such protrusions can be exXtracte' 
using morphological methods and then ITeappended after the polygonal ap 
PIoximation has been computed. 





;The following properties are basic ip formulating an approach for finding 
PPs: 


ary by a set of concatenated cells, as shown in Fig. 11.3(a). It helps to visuali The MPP corresponding to a Simply connected cellular compiex 1 not 
self-intersecting. Let 忆 denote thls MPEE 


Every comyex vertex of 已 coincides with a。(but not eyeIy， 18 3a VEIteX 
of 己 ). 

Every comcave vertex of P coincides with a。 (but not eveIy 。 is a VerTtex 
of 局)， 

,Ifa .in factis part of P,but itis not a conveX VerteX of 已 ,then it hies oO the 
ss edge of 已. 


Foundation 


of the strip of cells and thlink of the object boundary as a rubber band conz 右 
tained within the two walls. If the rubber band is allowed to shrink,it takes thes 





fits the geometry established by the cell strip， 

Sklansky's approach uses 3a So-called celiular coaplexr OT CeluiQr 7103Gi 
which ,for our purposes,is the set of sguare celis used to enclose a boundary, asi 
Fig. 11.3(a). Figure 11.4(a) shows the region (Shaded) enclosed by the celluja 
complex. Note that the boundary of this region forms a 4-connected path.As 允 
traverse thbis path in a clockwise direction, we assign ablack dot (*) to the COnYVe 
corners (those with interior angles equal to 90?) and a white dot (。) to the con- 党 
cave corners (those with interior angles equal to 270?).As Fig. 11.4(b) Shows， th 
black dots are placed on the convex corners themselves. IJhe white dots 3 
placed diagonally opposite their corresponding concave corners. This corr 


sponds to the cellular complex and vertex definitions of the algorlthrm. 


Jn our discussion, a vertex of a polygon 1 defined to be comyex 这 ts inierOr 
anple is in the range 0? < 0 < 180?; otherwise the vertex 1S CO7CIYe Asin the 
pievious paragTaph, convexity is measured with respect to the interlor Teglon 
We travel in a clockwise direction， 


Algorithm for Finding MIETSs 


Properties 1 through 4 are the basis for finding the vertices of an MPP Ihere are 
rious ways to do this (e.g., See Sklansky et al [1972j, and Kim and Sklansky 
昌 982]).The approach we follow here is deslgned to take advantage of two basic 
IPT/MATLAB functions. The first is qtdecomp, which pertorms quadtree de- 
mpositions that ljead to the cellular wajll enclosing the data of interest. The sec- 
ond is function inpolygon, used to determine which Points lie outslide, On，OT 
ide the boundary of a polygon defined by a given set of Vertices. 

I will be helpful to develop the procedure for finding MPEs in the context 
of an illustration. We use Figs. 11.3 and 11.4 again for this Purpose. An ap- 
proach for finding the 4-connected boundary of the shaded inner repgion In 
五 g 11.4(ay is discussed later in this sectlon， After the boundary has been ob- 
tained, the next step is to find its corners, whlch we do by obtaining its Free- 
man chain code. Changes in code direction indicate a corner in the boundary. 
By analyzing direction changes as we travel in a clockwise direction through 
the boundary it becomes a fairly easy task to determine and mark the coOnVex 
and concave corners, as in Fig, 11.4(b).The Specific apProach for obtalnlng the 





markers for the boundary 
of the reglon in (a). Note 
that concave markers are 
placed diagonally opposite 
their corresponding 
COIners. 


7TpecondironD= 做 
iPOLGCLowed cad 

日 = 780925 rreGiedt 
49 QSDeCiGL CQ32. 
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markers 1s documented in M-function minperpoly discussed later in this S6 
tion. The corners determined in this manpner are as in Fig. 11.4(b)， Which 
show again in Fig. 11.5(a).The shaded reglion and background grid are incji 
ed for easy Ieference. IJhe boundary of the Shaded region js not shown to avo 谤 
confusion with the polygonal boundaries shown throughout Flg. 11.5. 
Next, we form an Initial polygon using only the initial convex vertices (the bla 
dotsj, as Fig. 11.5(b) shows. We know from Property 2 that the set of MPP cony 
vertices 1 a Subset of thls initial set of convex vertices. We See that all the conca 
Vertices (white dots) jying ouiside the initial polygon do not form concavitie 
the polygon. For those particular vertices to become convex at a jater Stage 训 考 
algorithm, the Polygon would have to pass through them. But, we know that tk 
can never become convex becatuse all possible convex vertices are accounted 
at this point (it is possibje that their angle could become 180? later, but that wot 
have no effect on the shape of the polygon).Thus,the white dots outside the i initi 
polygon can be eliminated from further analysis, as Fig. 11.S(c) shows. 
The concavye vertices (White dots) inside the polygon are assSociated Wi 
concavities in the boundary that were ignored in the first Pass.Jhus, these ver 
tices miust be incorporated into the polygon, as shown in Fig. 11.5(d). At 刀 
polint generajlly there are Yertices that are black dots but that have ceasedtoH 
conVeX in the new Polygon [See the black dots marked with arrows i 
Fig. 11.3(d)]. There are two possible reasons for this. The first Teason mayh 
that these vertices are part of the Starting polygon ip Fig. 11.S(b)，which ai 
cludes al convex (black) vertices. The Second reason could be that they hay 
become convex as a Tesult of our having incorporated additional (white) ve 
tices into the polygon as in Fig.11.5(d).Therefore, all black dots in the polygo 
must be tested to See if any of the vertex angles at those points nOw exXcee 
180".All those that do are deleted. The procedure in then repeated. 
Figure 11.3(e) Shows only one new black vertex that has become concay 
during the Second pass through the data. The procedure teIminates whena 
further vertex changes take place, at which time all vertices with angles of 180 
are deleted because they are on an edge, and thus do not affect the shape 0 
the final polygon. The boundary in Fig. 11.5(f) is the MPP for our exampl 
Thlis polygon is the same as fthe polygon in 囊 g. 11.3(b). Finally, Fig. 11.4( 引 
shows the original cejlular complex Superimposed on the MPP 
The preceding discussion is Sumimarized in the following steps tor finding 
the MPP of a region: 


1 
人 
了 了 。 
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才 原 而 而 EN 





Obtain the cellujar complex (the approach is discussed later in this Section 六 
Ohbtain the region intermal to the cellular complex. 
Use function boundaries to obtain the boundary of the region in step 2 35S- 
a 4-connected, clockwise Sequence of coordiraaley， 
Obtain the Freeman chain code of this 4-connected sequence Using func- 
tion fchcode. 
Obtain the convex (black dots) and concave (white dots) vertices from 
chain code. 
Form an inltial polygon using the black dots as vertices, and dejete from 
further analysis any white dots that are outside this poljygon (white dof 
on the poiygon boundary are kept). - 


IGURE 11.5 (a) Convyex (black) and concave (white) vertices of the boundary In Fig. 4(a) (b) initial 
polygon joining all convex vertices. (C) Resujt aftter deleting coOncave vertices Outside of the polygon. 
td) Result of incorporating the remaining concave vertices into the polygon (the arrows indicate biack 
ertices that have become concave and will be deleted). (e) Restuit of deleting concave black vertices (the 
:8TOW indicates a black vertex that nOw has pecome concave). (位 Final result showins the MPP (g) MPP 
with boundary cells superimposed， 
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了 和 AMPLE 1.4: 
Obtaining the 
celluliar wall of 
the boundary of a 
Tegion， 
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7. Form a polygon with the remaining black and white dots aS Vertlces， 
8. Delete ali black dots that are concave vertices. 
9。 Repeat steps7 and 8 until all changes cease, at which time all Vertices wii 

angles of 180" are deleted.The remaining dots are the vertices of the MPI 
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Some of the M-Functions Used in Implementing the MPP Algorith 


We use function qtdecomp introduced in Section 10.4.2 as the first step in 0 
taining the cellular complex enciosing a boundary. As Usual, we consider 亿 
region, B, in question to be composed of 1s and the background of 0s. TI 
qtdecomp syntax applicable to our work here is : 


Q = qtdecomp{(B，threshold，[mindim maxdinj] ) 





where Q is a sparse matrix containing the quadtree Structure. 二 Q(k，m) si 
nonzero, then (k, m) is the upper-left corner of a block in the decompositions 
andthe Size of the block is Q(Ky my). 人 


天 半 F 
| 


A block is split iftthe maximum value of the block elements malnus the minl 
mum value of the block elements is greater than thresho1ld. Ihe vaiue of tj 





性 压根 于 一 硬 丁 本 





parameter is specified between 0 and 1, Tegardless of the class of the inpt 下 
image. Using the preceding syntax, function qtdecomp will not produce block 一 图 
smaller than mindim or larger than maxdim.Blocks larger than maxdinm are SP 一 坷 -二 


even i they do not meet the thresholid condition. The Tatio maxdimymindi 
must be a Power of 2. 

If only one of the two values is specified (without the bracketsj, the fun 
tion assumes that it is mindim. This is the formulation we use in this sectiol 
Image B must be of size K x K, such that the Tatio of K/mindinm is an integer: 
power of 2. Clearly, the smallest possible value of K is the jargest dimension of 
B,.The size requirements generally are met bypadding 8 with zeros with optiom 
'post' in function padarray.For example,suppose thatBis ofsize 640 X 480 
pixels, and we specify mindim = 3. Parameter K has to Satisfy the conditions: 
K>= max(size(B)) and K/mindim=2^p,orK=mindimx*(2^p).Solving forp 
gives p = 8,in which caseK= 768. 

To get the block values in a quadtree decomposition We USe function 
qtgetblk, discussed in Section 10.4.2: 
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0.0.0> 








[vals，r，c] = qtgetblk(B，Q，mindzmi 









寺 寺 村 出 站 才 
where vals is an array containing the vajues of the mindimxmindin blocks In 画 匿 贺 杆 而 天 而 恨 国 醒 限 古 厨 夯 国 
the quadtree _ decomposition of B, and Q is the sparse matrlx returned by 

qtdecomp. Parameters r and c are vectors containing the IOw and column co 


ordinates of the Upper-left corners of the biocks. 
onal padding is required for the specified value of mindim.The 4-connected 


boundary of the region is obtained uslng the following command (the margin 


冉 To see how steps 1 through 4 of the MPP algorithm aIe implemented， con- 
ote in the next page explains why 8 is used in the function cally): 


sider the jimage in Fig. 11.6(a), and Suppose that we Specify mindim = “. We 
show individual pixels as small squares to facilitate explanation cf function 
qtdecomp. The image is of size 32 X 32, and it is easily verified that DO addi- 





>> B = bwperim(B，8) 
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FIGURE 11.6 

(a) Oriiginal 
image, where the 
Small squares 
denote individual 
Pixels. (b) 4- 
connected 
boundary. 

(c) Quadtiree 
decomiposlitlon 
using blocks of 
minirnum size 2 
pixels on the slde. 
(d) Result of 
filling with 1s alj 
blocKs of Slze 
2X2that 
contalned at least 
one element 
valued 1. This 1S 
the cejlular 
COoImP1ex. 

(e) Inner reglon 
of (d). 

(但 4-connected 
boundary Polints 
obtained Using 
function 
boundaries.Ihe 
chain code was 
obtained using 
function fchcode. 
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bwperinm 
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Figure 11.6(b) shows the result. Note that B is still an image, whlich now con M-Function for Computing MPPs 


tains only a 4-connected boundary (keep in mind that the Small squares are 这 
dividual Pixels). 

Figure 11.6(c) shows the quadtree decomposition of B, obtalned using 
comimand 


0 1 through 9 of the MPP algorithm are implemented in function 
perpoly, whose hsting ls Included in Appendix C.The syntax is 


[Ix，yj = minperpoly(B，celLlsize) 


>> Q = qtdecomp (8B，0，2) 1; 
1 人 ere B ls an input blinary image containing a single region or boundary, and 
1size ls the size of the square cells in the cellular complex used to enclose 
boundary. Column vectors x and y contain the x- and y-coordinates of the 
PP vertices，. 


where 0 was used for the threshold so that blocks were split down to the min 
mum 2 X 2 size, regardless of the mixture of ls and 0s they contained (eac 
such block is capable of containing between zero antid four Pixels) Note 也 3 
there are numerous blocks of size greater than 2 X 2，but they are- 芷 
homogeneous. 
Next we used qtgetblk(B，Q，2) to extract the values and top-jeft corne 
coordinates of all the blocks of size 2 xx 2. Then all the blocks that contained 
at least one pixel valued 1 were 名 led with 1s. This result, which we denote 
BF, is shown in Fig. 11.6(d).The dark cells in this image constitute the celluli 
complex. In other words, these cells enclose the boundary in Fig.11.6(bD)， 
The region bounded by the cellular complex 训 Fig. 11.6(d) was obta 
using thbe command 


igure 11.7(aj) Shows an image, 8B, of a maple leaf and Fig. 11.7(b) is the 
ndary obtained using the commands 


boundaries(B，4， 
b{1)}; 

M，N] = ) 

min = man(b(:，1) 
main = min(bl(:，2) 
im = bound2imf(b， 

mshowtbinm') 


wj 


SiLze(B) 


W 


) 
) 
中 ，N，Xxmzan，yYmin) ; 


>> R = imfilL(BF， "holes ) & ~BF 
Figure 11.6(e) shows the result. We are interested in the 4-connected boundarK 
of this region, whbich we obtain using the commands lsexample. Figure 11.7(c) is the result of using the commands 
>> b = boundaries(b，4， CCcw ) 
>> b = bf11; 


y] = minperpoly(B，2) 
connectpoly(x，Y) 
bound2imftb2，M，N，xmin，ymnin) 

mshow(B2 ) 


区 

2 = 
咏 二 
Figure 11.6(f shows the result. The Freeman chain code shown in this fgt 


was obtained using function fchcode.This compjletes steps 1 through 4 of 
MPP algorithm. 


Function inpolygon is used in function minperpoly (discussed in the De 允 
section) to determine whether a point is outside, on the boundary, or inside 8 
polygoni; the Syntax 1S 
















lobes. These are preserved reasonably well even for cells of size 8,， as 
1.7(f) shows. Further increases in the size of the cells to 10 and even to 16 
tl preserve this feature, as Pigs, 11.8(a) and (b) show. However as Shown in 
IN = inpolLygon(X，Y，XV，YV) 
e arrows in Figs. 11.7(c) and (e) Point to nodes formed by self-intersecting 
DeS These nodes can arise 让 the size of the indentation in the boundary with re- 
ct to the cejll size is such that when the concave vertices are created, their PoO- 
onS “crossS”each other, altering the clockwise sequence of the vertices. One 
Proach for solving this problem is to delete one of the vertices.The other is 
crease OF decrease the cell Size. For example, Fig. 11.7(d), which corre- 
onds to a cell size of 3, does not have the problem exhibited by the vertice 


merated with celjs of sizes 2 and 4. 要 


the polygon vertices, arranged in a clockwise or counterciockwise Sequence 
Array IN js a vector whose length is equal to the number of points beingt 
ed. Its values are 1l for points inside or on the boundary of the polygom， and 
for points outside the boundary. 
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mznpbperpoly 
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了 上 XAMPLE 拭 .5: 
Using function 
mmiInperpoly. 
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FIGURE 11.7 

(a) Origlinal 
Image. 

(b) 4-connected 
boundary. 

(c) MPP obtained 
UsIing SqUaITe 
bounding cells of 
size 2. (d) through 
(人 MPPs obtained 
using Square cejls 
of sizes 3, 4, and 8%， 
resSPectively. 


91gnatuIes 


" 
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关 泛 
FiCURE 11.8 
MPPs obtalined 
with even largeT 
bounding Square 
cells of SiZes 

(a) 10, (b) 16, (c) 
20, and (d) 32. 





ature is a 1-D functional representation of a boundary and may be gen- 
ed in various ways. One of the simplest is to plot the distance from an inte- 
oint (e.g., the centroid) to the boundary as a function of angle，a8 
ated in Fig. 11.9. Regardless of how a Signature 15 generated, however， 
sic jidea is to reduce the boundary representation to a 1-D function， 
h presumably is easier to describe than the original 2-D boundary. 上 eePp 
mind that it makes sense to consider using Signatures only when it can be 
aranteed that the vector extending from its origin to the boundary InterSects 
boundary only once, thus yielding 3 single-valued function of increasing 
le.This excludes boundaries with self-intersections, and it also tyPicaly ex- 
des boundaries with deep,narrow concavilties OT thin, long Protruslons. 

Signatures generated by the approach just described are invariant to transja- 
n,but they do depend on rotation and Scaling. Normalization with respect to 


450 ”Chapier ]1 党 Representation and Description 和 未 二 ES 又 


FIGURE 11.10 
7 YY ee 
FIGURE 11.9 MATLAB for 

人 a) and (b) performing 
Clrcular and COTnverslons 
Square Objects. -4 一 0 between pofar 
(c) and (d) r(b) and Carteslan 


coordinates, and 
Vice veTSa. 





下 四 
to 上 
二 | 守 
































Corresponding _ 
distance VeTSUS Y24 
angje Signatufes，. 4 到 一 一 
名 mature as a function of increasing angle is output In St. Coordinates (x0， 
inthe input are the coordinates of the origin of the vector extending to the 
ndary. If these coordinates are not included in the argument, the function 
is to seject the starting Point as the Polnt farthest 区 om the OTlgIn of the Vecto Uses the coordinates of the centrojd of the boundary by default. In either case， 
(See Section 11.3.1), 寺 thls potmt happens to be unlique and independent of fie values of (x0，y0) used by the function are included in the au he 
tational aberrations for each shape of Interest. e of arrays st and angle is 360 x 1l, indicating a resolution of one degree. 
Another way ls to select a point on the major elgen axXls (See Section 二 ie input must be a one-pixel-thick boundary obtained , for example, using 
ipbls method requires more computation but 1 more ITUgged because the ctlon boundaries (see Section 11.1.3).As before, we assume that a bound- 
rection of the elgen axes 1l$s determined by using all contour points. Yet ang yis a closed curve, 
er Way ls to obtain the chain code of the boundary and then use the appro Function signature nutilizes MATLAB's function cart2pol to conyvert 
discussed in Section 11.1.2,assuming that the rotation can be appTOXimated artesian to Polar coordinates.The syntax is 
tbe discrete angles in the code directions defined In Fig. 工 .1 
Based on the assumptions of unliformity In Scaling with respect to both zx 
and that sampjling is taken at equal intervals of 0, changes in Size of a shapeTew 
sujlt in changes in the ampjitude values of the corresponding Signature. De 
to normalize for tbls dependence js to Scale all functions So that they alwa 
Span the Same range of values, say, [0,. 1]. The main advantage of this meth 
Simplicity but it has the potentially serlous disadvantage that scaling of the el 
tire function ls based on onjly two values: the minimum and maximum， 开 包 
Shapes are noOlsy, thls can be a SoUrce of erTor from Object to object, A mef 
rugged approach ls to divide each sample by the variance of the signature， 
suming that the yariance ls hot zero 一 as in the case of Fig. 11.9(a) 一 oOT So Sm 
that it creates computational difficujties, Use of the variance yields a Varl8 
scajling factor that Is inversely proportional to changes in Size and WOIKS TU 
as automatic gain coOntrol does. Whatever the method used, keep in_ mind 也 





re 
人 
1 


[THETA，RHO] = cart2po1(X，Y) 





att 2po1l 
Where X and Y are Vectors containing the coordinates of the Cartesian points.The 
ors THETAand RHO contain the corresponding angle and length ofthe polar co- 
urdinates. 开 X and Y are IOow vectors, so are THETA and RHO0, and similarly in the 
ase of columns. Figure 11.10 shows the convention used by MATLARB for coordi- 
te conyersions. Note that the MATLAB coordinates (X,，Y) inthis Situation are 
ated to our image coordinates (X,， y) as X = yandY = -xfsee Fig.2.1(a)]. 
unctlon pO12cart js used for converting back to Cartesian coordinates: 


[X，Y] = pol2cart(THETA，RHO) 





5 
sr 了 OE2Cart 


iigures 11.11(a) and (b) show the boundaries, bs and bt, of an irregular EXAMPLE 11.6: 
Juare and triangle, respectively embedded in arITays of Size 674 X 674 pixels， Signatures. 
Te 1T.11(c) shows the signature of the Square, obtained using the commands 


tal shape of the wavefornmas. 
Function signature,included in Appendix C,finds the Signature of a 外 

人 [st，angle，x0，y0] = signature (bs ) ; 

signature [st，angle，x0，y0] = signature(b，x0，Yy0) plot(ang1e，st) 

me Values of x0 and y0 obtained in the preceding command were [342, 326]. 


where b is an AP X 2 array containing the xy-coordinates of a boundary 二 
Simliar pair of commands yielded the plot in Fig. 11.114d), whose centroid is 


dered in a clockwise or counterclockwise direction. The amplitude of 也 
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FICURE 13.12 
(a) A reglon 9 
and its CORVexX 


ab 

Cd 

FIGURE 11.11 
(a) and (b) 


Boundaries of an deficiency 
irregujlar SqUaTIe (Shaded). 

and triangle. (b) Partitioned 
ws 
Corresponding 

SignatuTes. 

























TLAB tools necesSary to implement boundary decomposltion in the man- 
ist descrIibed are contained in function regionprops, whlich is discussed 





400 


人 站 上 ireduce it to a graph, JIhis reduction may be accomplished by obtaimning the 


[he skejleton of a region may be defined via the medial axis transformation 


220 AT).The MAT of aregion 尺 with border pb is as follows. For each Point P in 
e find its closest neighbor in D.IfPp has more than one Such neighbor, it 1 

obelong to the 1aedial axig (SKeleton) of 尺 . 
150 人 lthough the MAT of a region is an intuitive concept, direct Implementa- 


100 200 300 400 :人 .of this definition is expensive computationally as it involves calculating 


pon, Numerous algorithms have been proposed for improving comPputational 
ency while at the same time attempting to appProximate the medial axXls 
esentation of a Teglon， 

s noted in Section 9.3.4, IPT generates the skeleton of all reglons con- 


located at [416, 335]. Note that simply counting the number of promineni 
peaks in the two signatures is suffticlent to differentiate between the 


boundarles， 


11.2.4 Boundary Segments 


Decomposing a boundary into Segments often is useful, Decomposition 臣 S = bwmorph(B， 'Skel'，Inf) 


duces the boundary's complexity and thus simplifies the description PIOC 人 
This approach is particularly attractive when the boundary contains One 章 
more significant concavities that carry shape information, In thls case US& 
the convex hull of the region enclosed by the boundary 1S a powerful tool 
robust decomposition of the boundary. 

The conyex jz1 of an arbitrary set Sis the smallest convex set contaln 
S The set difference 囊 -Siscailed the conyvex deficiencyD,of the set 人 
see how these concepts might be used to partition a boundary into meanln 
segments, consider Fig. 11.12(aj, which shows an object (set 9) and its COD 
deficiency (shaded regions), The region boundary can be partitioned by 
lowing the contour of $ and marking the points at Which a transition 1 本 ” 
into or out of a component of the convex deficiency.EliguTe 11.12{(b) ShowS 
result in this case. In Principle, this scheme is independent of region SiZe 3 


is function removes pixels on the boundaries of objects but does not allow 
jects to break apart.The pixels remaining make up the image skeleton. TIhls 
on preserves the Euler number (defined in Table 11.1)， 


igure 11.13(a) shows an image, f, repfesentative of what a human chro- 上 XAMPLE 1.7: 
mosome looks like after it has been segmented out of an electron microscope “Computng the 
iage with magnification on the order of 30,000X.Thbe objective of this exam- ee 史 

l to compaute the skeleton of the chromosome， 8 
early,the first step in the Process must be to isolate the chromosome from 

background of irrelevant detail. One approach is to Smooth the image and 

en threshold it, Figure 11.13(b) shows the result of smoothing f with 3 

X 23 Gaussian Spatial masKkK with sig = 15: 
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oa 


that increasing the threshold value increases the amount of data Te- 
bwmorph(tg， "skel' ，Inf); 点 Fig， 11.13(d) 


ft.= bwmorph(s， 'Spur' ，8); %Fig. 11.13(e) 


mately one-hajlf the value of sig in the smoothing filter. Several small 
ti remain in the skeleton. However, applyijng the previous function an 
onal 7 times (to_ complete the value of sig) yieided the result n 
,13( 人 ,which is a reasonable Skeleton representation of the Input. As 3 
thumb,the value of sig of a Gaussian smoothing mask is a good 8gulde- 
for the sejlection of the number of times a Spur Iemoval algorltbm 
党 


S Section we discuss a number of descriptors that are Useful when WorIkK- 
ith region boundaries. As will become evident shortly many of these de- 
btors can be used for boundaries and/or regions,and the grouping of these 
Giiptors in IPT does not make a distinction regarding their applicability， 


3 

de 在 

FIGURE 11.13 (a) Segmented 75 X 25 Gatg 
averaging mask with sig = 1 human_ chromosome. (b) Image smoothed using 3 


spur removal, ( 蕊 Result of 7 a 5.(c) Thresholded image. (d) Skeleton, (e) Skeleton after 8 applicat 
in dditional appjications of SpuT reInoval 


人 > > 一 。 

人 _ >m2double(f); 
”fspeciall'gaussian'，25，15) | 
umfilter(f，h，"repLicate ) 
Oow(g) %5 Fig.， 11.13(b) 


>> 昌 
>> Imsh 













有 n Whether 4- of 8-connectivity (the default) is desired (see the margin 
In xample 1i1.4 concerning the interpretation of these COnnectivlity val- 
水 The Objects in f can have any pixel values consistent with the image class, 





Ne threshold the smoothed image: 


g = : 

figurnm2bw(g， 1.5*graythresh ; tall background pixels have to be 0. By definition, the perimeter pixels are 
e，jimshow(g) % Fig，11.13(9) and are connected to at least one other nonZero Pixel. 

0 automatically determined threshoid， graythresh(g)， was 王 ， “Onnectivity can be defined in a more general way in IPT byusinga3xX: 


pled by 1 S to increase by 50% the amount of thresholding.The reasoning 基 TIX of 0s and 1ls for conn, The 1l-valued elements define nelghborhood 
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dametenr 














ljocations Ielative to the center element of conn. For example, conn = ones 


defines 8-connectivity Array conn must be symmetric about its cente 
ment, The Input Image can be of any class. The output image containing he 
boundary of each object in the input is of class logical. 
The daeter of a boundary is defined as the Euclidean distance betg 
the tfwo farthest points on the boundary. These points are not always uniq 4 
in a circle or Square, but generally the assumption is that 过 the diamete < 


be a useful descriptor, it is best applied to boundaries with a single pair of 信 
thest points. The line segment connecting these points is called the 117010 
of the boundary. TIhe mizpor axis of a boundary is defined as the line perp 
dicular to the major axis and of such jength that a box passing throug 
outer four points of intersection of the boundary with the two axes com 
ly encloses the boundary. Ihis box is called the pasic rectamgle, and the rati 
the major to the minor axis is called the eccemtricity of the boundary 

Functon diameter (see Appendix C for a listing) computes the diame 
major axis, minor axis, and basic rectangle of a boundary or Tegion. Its Syn 


Ss = diameter(L) 
Chain code: 000030032232221211 
Difference: 300031033013003130 


where Lis a label matrix (Section 9.4) and s is a structure with the follo 0 Shapeno: 000310330130031303 


fields: 


s.Diameter A scalar, the maximum distance between any two Pi 
in the coTTesponding region， 本 
s ,MajorAxis A2Xx2matrix.The IOwWS contain the row and colu 
coordinates for the endpoints of the major axis of 
COITesponding Tegion. 
S,MinorAxiS A2<Xx2matrx.TIhe rows contain the row and co 
coordinates for the endpoints of the minor axis o 
corresponding Tegion. : 
SBasicRectangle A4X2IaIX.Each row contains the row and colu 
coordinates of a corner of the basic rectangle. 


1.3.2 Shape Numbers 


The spape Uper of a boundary, generally based on 4-directional Freer 
chaln codes, is defined as the first difference of smallest magnitude (Briblesk 
and Guzman [1980j], Bribiesca [1981]).The order of a shape number is define 
at the Dumber of digits in its Tepresentation. Thus, the shape numpber of 和 
boundary is given by Parameter c,diffmm in function fchcode discussed 
Section j11.2.1，and the order of the shape number is compute 
JIength(c.diffmm)， 

Asnoted in Section 11.2.1, 4-directional Freeman chain codes can be 匀 
insensitive to the starting point by using the integer of minimum magni 
and made insensitive to rotations that are multipies of 90。 by using the 


Ience of the code. Thus, shape numbers are insensitive to the starting 
tand to rotations that are multiples of 90". An approach used frequently 
rmalize for arbitrary rotations is to align one of the coordinate axes with 
major axis and then extract the 4-code based on the rotated figure. The 
rocedure is llustrated in 珀 g. 11.14. 

The tools required to implement an M-function that calculates shape num- 
Shave been developed aljready. They consist of function boundaries to exX- 
t the boundary，function diameter to find the malor axis，function 
samp to reduce the resolution of the sampling grid, and function fchcode 
Xtract the shape number. Keep in mind when using function boundaries 
xXtract 4-connected boundaries that the inpnut image Imust be ljabeled using 
abel with 4-connectivity specified,. As indicated in g. 11.14, compensa- 
or rotation is based on aligning one of the coordinate axes with the major 
,The x-axis can be aligned with the major axis of a region or boundary by 
ing function x2majoraxis. The syntax of this function follows; the code is 
cluded in Appendix C: 


























[B，thetal = X2majoraxis(A，8B) 


Te@A=S.MajorAxis from function diameter,andBis an input (binary) 
age or boundary list, (As before, we assume that a boundary is a connected， 
0Sed curve.) On the output, B has the same form as the input (ie.,a binary 
Wasge Or a coordinate sequence.Because of possible round-offt error, rotations 
resujlt in a disconnected boundary sequence, SO postprocessing to relink 


"When more than one pair of farthest points exist, they should be near each other and be dominan [ 
“Points (using, for example, bwmorph) may be required. 


tors In determining boundary shape， 
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939 

FIGURE 11.14 
9teps in the 
generation of a 


shape number， 


X2majoraxiSs 
和 
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了 上 入 AMPLE 11.8: 
Fonuirier 
descriptors. 


不 
FIGURE 11.16 


(a) Binary imagpe. 


(b) Boundary 
extracted using 
function 
boundaries,The 
boundary has 


1090 points. 













oth in orderto illustrate the effect that reducing the number of descriptors 
as:On the shape of a boundary.The image in Fig. ].16(b) was generated using 
Percommands 


integer no greater than length(Z) ，If ND is omitted，it defau 
to Jength(Z)，The output，S，is an ND-by-2 matrix containing 
coordinates of a Closed boundary， 


Preliminaries， 

np = length(zy); 

各 Check :inputs . 

if nargin == 1 | nd> np 
nd = np; 

end 

5 Create an alternating Sequence of 1s and -1s for use in centering 

沁 the 直 ransform， 

X 

有 


oo oo 


= boundaries(f) ; 


= bf1l; 
bim = bound2im(b，344，270) ; 


re the dimensions shown are the dimensions of 寺 . Figure 11.16(b) shows 
mage bim、 The boundary Shown has 1090 points. Next， we compnuted the 
Eourier descriptors， 


0o:(np 一 1); 

(( 一 人) X) 

Use only nd descriptors in the_ inverse. Since the 
descriptors are centered，(np -- nd)/2 terms from each end of 


和 
下 
光 the Sequence are set to 0， 

d = round((np - nd)/2); 5 Bound in case nd is odd 
> 

Z 

移 


= frdescp(b) ; 


obtained the inverse using approxlimately 3096 of the possible 1090 


(1:d) = 0， 1ptors: 
(np 一 dr+1:np) =0; 


Compute the inverse and convert back to _ coordinates 2z546 = ifrdescp(z，546) ; 


ottom-ftacing cusp in the original boundary, were lost, but, for all practical 
purposes， the two boundaries are identical. Figures 11.17(b) through (f) 
how the results obtained using 110, $6,28, 14, and 8 descriptors, which are 
IOXImately 10%,S% ,2.5% ,1.259% and 0.7%, of the possible 1090 de- 
tors， TIhe result obtained using 110 descriptors [Fig. 11.17(c)] shows 
light further smoothing of the boundary, but， again, the general shape is 
uite close to the original. Figure 11.17(e) shows that even the result with 
人 descriptors, a mere 1.25% of the total, retained the Principal features of 
boundary. Figure 11.17(f) shows distortion that if Unacceptable be- 
Se the main feature of the boundary (the four long Protruslons) was 
st. 上 urther reduction to 4 and 2 descriptors would result in an ellijpse and， 
nally, a circle. 

Some of the boundaries in Pag.11.17 have one-pixel gaps due to round off in 
Kel values. These small gapS common with Fourier descriptors, can be re- 
aired with function bwmorph Using the'bridge' option. 避 


(1) = mx*s(I，1)) 


zz = ifft(Zz); z546im = bound2im(z546，344，270); 
s(:，1) = real(zz); 

(5 2) = imag(zZ); mage z546inm {fFig. 11.17(a)] shows close coOITespondence with the orig- 
2 aternating 1 and -1S to undo the earlier boundary in Fig. 11.16(b). Some subtle details, like a 1-pixel bay in the 


贸 Figure 11.16(a) shows a binary image, f, similar to the one in Fig.11.13( 
but obtained using a Gaussian mask of size 1 x 15 with Sigma = 9，,: 
thresholded at 0.7. The purpose was to generate an image that was not overI 

















As mentioned earlier， descriptors should be as insensitive as possible to 
anslation, rotation, and scale changes. In cases where results depend on the 
rder in which points are processed an additional constraint is that descriptors 
hould be insensitive to startlng point. Fourier descriptors are not directly in- 
nsitive to these geometric changes, but the changes in these Parameters can 
Me Telated to Simple transformations on the descriptors (see Gonzaiez and 
Woods [2002])， 
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素 一 1 
二 六 FSg(D) 
一 


this notation, K is the number of points on the boundary, and HLn(r) is di- 
ectly related to the shape of g( 门 . For exarmple, the second moment Hz 人 (7 
ieasures the Spread of the curve about the mean value of rand the third mo- 
ent A3(r) measures its Symmetry with reference to the mean.statistical mo- 
nts are Computed with function statmoments, discussed In Section 5.2.4. 

What we have accompllshed ls to reduce the description task to T-D func- 
ons. Although moments are a PopPular apProach, they are not the onjy de- 
Triptors that could be used for this Purpose. For instance, another method 
yolves compnuting the 1-D discrete Fourier transform,obtaining lts spectrum， 
using the first dg components of the spectrum to describe 8( 站 .The advan- 
e of moments over other techniques is that impPlementation of moments 1S 
raightftorward, and moments also carry a “physical” interpretation of bound- 




























年: 是 


FIGURE 11.17 (a)-( 担 Boundary reconstructed using 546, 110, 56, 28, 14, and 8 Fourier descriptors Out of 
possible 1020 descrlPtorS， 


1134 Statistical Moments 村 this section we discuss a numiber of IPT functions for Teglon processing and 
Th b d 人 sntroduce Several additional functions for computing texture, moment Invaril- 
Re nts，and Several other regional descriptors. 民 eep in mind that function 


ture waveforms) can be described quantitatively by using statistical momentsy 
such as the mean, variance, and higherorder moments. Conslder Fig. 11.18(a)， 
which shows a boundary segment, and Fig. 11.18(b), which shows the segment 
represented as a 1-D function, g(r), of an arbitrary variable .This functlon W 
obtained by connecting the two end points of the segment to torm a“major axiS 
and then using function Xx2majoraxis discussed in Section 11.3.2 to algn tbe: 
malor axis with the x-axis. 
One approach for describing the shape of g() is tonormallize itto unlt area 3nd 
treat it as a histogram. In other words, g(P) is treated as the probability of value 
oOccurring. In this case,ris considered a random valiable and the moments 3T6 


g we outline in this section. Function Poipoly (Section 5.2.4) also ls used 
equently in thls context. 


4. Function regionprops 


Function regionprops is IPT's principal tool for compnuting region descrip- 
ISThils function has the Syntax 


D = regionprops(L，properties) 
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3b 

FIGCURE 11.18 

(al) Boundary 
Segment. 

(b) Representation 
as al1-D function， 


2 Pegionprops 
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了 上 久 AMPLE 11.9: 


Using function 
regionprops， 





































BLE 11.1 


alid Strings for 


whbere Lis a label matrix and D is a structure arTay of jength max(L(: ) ). roperties 


fields of the strtucture denote different measurements for each reglon, as SB 
ified by properties.Argument properties can be acomma-Separated ]i 
strings, a cell array containing strings, the Single string all ,or the stri 
'basic'.Table 11.1 jists the set of valid Property Strings. If properties is 
string 'all'，then all the descriptors im Iable 11.1 are computed. 
properties is not specified or 诺 itis the string 'basic',then the descriptor 
compnuted are 'Area'，'Centroid' ,and 'BoundingBox '. 玫 eep in mind (3 
discussed in Section 2.1.1) that IPT uses x and y to indicate horliZontal and ver 
tical coordinates,respectively with the origin being located in the top, left.( 
ordinates X and y increase to the Tight and downward from the origi 
Tespectively. For the purposes of our discussion, on pixels are valued 1 whi 
off pixels are valued 0. 





Area， 
IBoungdIngBOXx， 


ICentroid 


IConveXArea， 
IConvVeXHUI1]: 


于 As a simple ilustration, suppose that we want to obtain the area and 也 ConvexTmage 


bounding box for each region in an Image B. We WTIlte 


% Convert B to a label matrix ， 
-boundingbox ) ; 


>> 日 = bwlabelf(B) ; 


， ， 'ECCemntrIzcity 
>> D regionprops(B， area 


上 


To extract the areas and number of reglons we WTIite 


>>W= [D.Areaji 

>> NR = LIengthtw) 
'EULerNumber 
where the elements of vector w are the areas of the Tegions and NR ls the num 
ber of regions, Similarly we can obtain 3 Single matrIx whose IOWwWS 3Te th 
bounding boxes of each reglon using the statement 


EXxtent 


EX 起 Pema' 


V= cat(1，D,BoundingBox ) ; 


This array is of dimension NR X 4. The cat operator is explained 
Section 6.1.1， 


FiLLedArea' 
FiLLedImage 


刘 , 妈 Texture 


An important approach for describing a region ls to quantify lts texture co 
tent. In this Section we illustrate the use of two new functions for computin 
texture based on statistical and Spectral mneasuUres. 


Image' 


Statistical ApProaches 


A frequentiy used approach for texture analysis is based on statistical PITOPEI 
ties of the intensity histogram. One class of such measures ls based on statisti 
cal moments. As discussed in Section 5.2.4, the expression for the mth momen 
about the mean ls given by 


Orientation， 


PiXe1lLiSt， 
Solidity， 


二 1 
= ( 丰 )D(3 


/一 (0 


'EqUILVDIametenr 


MajorAxisLength 


MinorAxiSsLength:， 


0 by function ee 


0 


The number of pixels in a reglon. 

1 X 4vector defining the Smallest rectangle contalning a reglon,.BoundingBox is 
defined by [ulL_corner width], where UL_corner ls In the form [x yl] and 
specifies the UppPer-left corner of the bounding box, and width ls In the form 
[x_width y_width] and specifies the width of the bounding box along each 
dimension.Note that the BoundingBox is aligned with the coordinate axes and, in 
that sense, is a Special case of the basic rectangjie discussed in Section 11.3.1. 

1 X 2 vector' the center of mass of the region. TIhe first element of Centraoid is 
the horizontal coordinate (or x-coordinate) of the center of mass, and the Second 
element is the vertical coordinate (or 六 Coordinate). 

Scajlar' the number of Pixels in “ConvexImage '， 

pP X 2matrix'; the smallest convyex polygon that can contain the region, Each row of 
the matrix contains the x- and y-coordinates of one of the P vertices of the polygon. 
Binary imagei the convex hull, with all Pixels within the hujll filled in (1.e., Set to 
on). (For Pixels that the boundary of the hull passes through, regionprops USeS 
the same logic as roipoly to determine whether the pixel is inside or outslde the 
hujll.) The image is the size of the bounding box of the reglon. 

Scalar' the eccentricity of the ellipse that has the Same second moments as the 
region.The eccentricity is the ratio of the distance between the foci of the eljipse 
and its major axis length.The value is between 0 and 1, with 0 and 1 being 
degenerate cases (an elljpse whose eccentricity is 0 is a circle, while an elipse with 
an eccentricity of 1 is a line segment)， 

Scalar' the diameter of a circle with the same area as the region. Computed as 
sqrt(4*Areay/7pI). 

Scalar; equal to the number of objects in the region minus the number of holes in 
those objects. 

Scalar; the pIopertion of the Pixels in the bounding box that are also in the 
region. Cormputed as Area divided by the area of the bounding box， 

8 X 2 matrix; the extremal points in the reglon. Each row of the matrix contalns 
the xz- and y-coordinates of one of the polints. The format of the vector is [top- 
Left，top-right，right-top，right-bottom，bottom-right，bottop- 
Left，1Left-bottom，1eft-top]. 

The number of on Plxels in FiL1edImage. 

Binary image of the same Size as the bounding box of the region. The on Pixels 
correspond to the region, with all hoies filled. 

Binary image of the same size as the bounding box of the region; the on PiXels 
correspond to the region,and all other pixejls are off. 

The length (in pixels) of the major axisy of the eliipse that has the same second 
moments as the reglon. 

The length (in pixels) of the minor axisft of the ellipse that has the same Second 
moments as the region. 

The angle (in degrees) between the x-axis and the major axis of the ellipse that 
has the Same second moments as the region， 

A matrix whose rows are the [X，y] coordinates of the actual pixels in the region. 
Scalar; the PIoportion of the Pixels in the convex hull that are also in the reglon. 
Compnuted as Area/ConvexArea， 


Note that the use of major and minor axis in this context is different from the major and minor axes of the basic rectansle dis- 
cussed in Section 11.3.1. For a discussion of moments of an ellipse,see Haralick and Shapiro [1992]. 
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TIABLE 11.2 
Some descrlptors 
of texture based 
on the intenslty 
histogram of a 
reglon， 
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TABLE 11.3 
Texture measures 
for the reglons 





Moment 了 xpressiof Measure ofTexture Average Average | -IThirdq 
Intensity Contrast  ， R ”Moment 





Uniformity -Entropy 























-1 
1 ee 人 2 人) 和 8702 117 ”0002 -0011 0.028 5.367 shown in 
119.93 73.89 0.078 2.074 0.005 7.842 Fig, 11.19. 
Standard deviation ao = VAa(z) = Vao” Ameasure of average contrast， 98.48 33.50 0.017 0.557 0.014 6.517 
Smoothness R=1-1/1+o) Measures the relative smoothness of 
the intensity in a region.Ris0Orfora: 
region of constant intensity and t = Statxture(f，Scale) Statxture 
0 





approaches 1 for regions with large 
eXxcursions in the values of its 
intensity levels. In Practice, the 
variance used in this Ineasure is 
normalized to the range [0,1j by 
dividing itby (了 一切 ” 






Where f js an input image (or Subimage) and t ls a 0-element TOw Vector whose 
0mponents are the descriptors in Table 11.2, arranged in the same order, Para- 
leter Scale also is a 6-element IOw vector, whose components multiply the cor- 
esponding elements of t for scaling purposes.If omitted, scale defaults to all 1s. 





The three regions outlined by the whijte boxes in Fig. 11.19 fepresent, from 王 XAMDPLE 11.10: 


ft to right, examples of smooth, coarse and periodic texture.The histograms 。 Statistical texture 
measSUTIeS. 


Third moment 1 = > (zi 一 mm)pP(zi) Measures the skewness of a histogram: 
This measure ls 0 for Syminetric 
histograms, positive by histograms 
skewed to the right (about the mean) 
and negative for histograms Skewed to 
the left.Values of thls measure are 
brought into a range of values 
comparable to the other five measures 
by dividing ua by (二 - 1 also, which 
is the same divisor we Used to 
normajlize the variance. 









fthese regions, obtained using function imhist, are Shown in Fig. 11.20. The 
ntries in Table 11.3 were obtained by appPlying function statxture to each 
fthe subimages in Fig. 11.19.These resujts are in geneIal agreement with tbe 
exture Content of the respective Subimages. For example, the entropy of the 
oarse Tegion [Fig. 11.19(b)] is higher than the others because the values of 
e pixels in that region are more random than the values in the other 












Uniformity 避 = > PD(z1) Measures uniformity.This measureis 
maaximum when all gray levels are 
equal (maximajlly uniform) and 

decreases from there. 



















Entropy e= 一 >》p(z)logzp(z) Ameasure of randomness. 
=0 


where zi is a random variable indicating intensity P(z) is the histogram of the 
intensity levels in a region, 志 is the number of possible intensity levels, and 


-1 
j1? 二 之 ZiD(2i 


is the mean (average) intensity. These moments can be computed with func 
tion statmoments discussed in Section 5.2.4.Table 11.2 lists some common de 
scriptors based on statistical moments and also on uniformity and entropy 
Keep in mind that the second moment, Az(z),is the varlance， Cr 

Writing an M-function to compute the texture measures in Jabjle 11.3 也 
straightforward. Function statxture, written for this Purpose, 1S included 普 


Appendix C.The syntax of this function 1S 





GURE 11.19 The subimages shown represent, from left to right, smooth，coarse, and periodic texture. 
了 hese are optical microscope images of a superconductor，human choiesterol，and a mlcTOPIOCeSSOT， 
(Original images courtesy of Dr. Michael W. Davidson, Florida State University.) 


468 ”Chapter 11 机 Representation and DescriPtion 

























































1800 350 1000 The results of these two equations constitute apair of values {S(r), (9jj for 
1600 300 900 zch pair of coordinates (~, 6). By varying these coordinates we can generate 
1400 250 wo 1-D functions, S(r) and $(6), that constitute a spectral-energy description 
1 200 000 texture for an entire image or region Under consideration. Furthermore, de- 
请 了 iptors of these functions themselves can be computed in order to character- 
600 100 300 ze their behavior quantitatively， Descriptors typicaly used for this Purpose are 
400 由 200 从 ie location of the highest value, the mean and varlance of both the amplitude 
6 nd axial variations, and the distance between the mean and the highest yajlue 
0 50 100 1S0 200 250 0 S0 100 150 200 230 0 5S0 100 13s0 200 the function. 


省 be 
FIGURE 11.20 Histograms corresponding to the subimages in Fig. 11.19. 将 


[fsrad，Sang，S] = Specxturet(Tf) 
regions. This also is true for the contrast and for the average intensity in tb 
case. On the other hand, this region js the least smooth and the least Unifo 

as revealed by the values of R and the uniformity measure, The histogram 
the coarse Tegion also shows the greatest jack of symmetry with respect tot 
location of the mean value, as is evyident in Fig. 11.20(b)j, and also by 也 


王 下 2 h istTj 
rssstvalte pihe iind iomeitahown 旋 Tuble113 igure (a)j shows an image with randomjly distributed objects and 


Tig, 11.22(b) shows an Image containing the same objects, but arranged perl- 
dically. The corresponding .Fourier Spectra，computed using function 
pecxture, are shown in Figs. 11.21(c) and (d). The Periodic bursts of ener- 
extending quadrilaterally im two dimensions in the Fourier Spectra 3aTe 
te to the periodic texture of the coarse background material on which the 
iatches rest, The other components of the spectra in Fig. 11.21(c) are clear- 
aused by the random orientation of the strong edges in Fig. 11.21(a). By 
trast, the main energy in Fig.11.21(d) not associated with the background 
Jong the horizontal axis，corresponding to the strong vertical edges 1n 
地 . 11.21(b). 
:Higures 11.22(a) and (b) are plots of 9(r) and (9) for the random matches， 
,and similarly in (c) and (d) for the ordered matches, all computed with function 
pecxture, Jhe plots were obtained with the commands plot(srad) and 
lot(sang).The axes in Figs.11.22(a) and (c) were scaled using 


Spectral Measures of IJexture 


Spectral measures of texture are based on the Fourier Spectrum， which is idea 
suited for describing the directionajity of periodic or almost periodic 2-D Patterns 
in an image.These global texture patterns, easily distinguishable as concentrations ss 
of high-energy bursts in the spectrum, generally are quite diffticujt to detect 
spatial methods because of the local nature of these technlques， Thus spectral te 
ture is useful for discriminating between periodic and nonperiodic texture P 
terns, and, further,for quantifying differences between periodic patterns. 
Interpretation of spectrum features is Simplified by expressing the SPe 
trum in Polar coordinates to yield a function Sr 6), where 5 ls the Spectrum 
function and r and 9 are the variables in this coordinate system. For each d 
rection b, S(r, 9) may be considered a 1-D function, Se(r). Similarly, for eac 
frequency r, Sr-(9) is a 1-D function, Analyzing Se(r) for a fixed value of 
yields the behavior of the spectrum (such as the Presence of Peaks) along aI 
dial direction from the origin, whereas analyzing 9-(9) for a fixed vajue of 
yields the behavior along a circle centered on the origin. 
A global description is obtained by integrating (summing for discrete VaIL 
ables) these functlons: 


> axlis([horzmin horzmax Vertmin Vertmax ] ) 


scussed in Section 3.3.1, with the maximum and minimum values obtalned 
om Fig. 11.22(a). 

TIhe plot of S(r) corresponding to the randomjly-arranged matches shows 
0 Strong periodic components (ie., there are no peaks in the Spectrum be- 
des the peak at the origin, which is the DC component). On the other hand， 
e plot of S(r) corresponding to the ordered matches shows a Strong peak 
earr = 19andasmaller onenearyr = 23. Simjiarly,the random nature of the 
过 energy bursts in Fig， 11.21(c) is quite apparent in the Plot of SU8) 
8.11.22(b),. By contrast, the Plot in Fig.11.22(d) shows Strong energy COmPO- 
ents in the reglon near the orlgin and at 90? and 180?. This is conslstent with 


e energy distribution in Fig. 11.21(d). 丝 


and 


2 
2 
| 
iM 
mn 
宇 


where Ruis the radius of a circle centered at the origln. 
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specxture 
ee 


卫 XAMDPLE 11.11: 
Computing 
Spectral teXture. 
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和 bb: 

Cd 

FIGURE 11.21 
(a) and (b) 
Images of 
unordered and 


ordered objects. 


(c) and (d) 
Corresponding 
Spectra. 
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和 
P9 上 0 
0T D,d9 = 0, 1,2,,..，where 
十 
已 全 二 了 
.4.3 Moment Invariants 
The 2-D rorment of order (P + 9) of adigital image F(x,y)is definedas  : 人 
5 Aset of seven 2-D oaet ivarianz that are insensltive to translation, scajle 


1771p4 一 六 六 
》 


二 


gi = T20 十 702 

fj = (720 一 702) 六 十 4701 

43 = (730 一 3912) 十 (3721 一 703) 
fa = (730 + 312) 十 (721 十 703) 


for D,9 = 0,1,2,..., where the Summations are over the values of the spatial 
coordinates Y and y Spanning the image, TIhe corresponding ce7tral OHEHL 1S: 
defined as 


pw 六 (r 下 一下 fx, 中 5 二 (730 一 TD) (an + 12)[(730 十 ID 
x 一 34721 十 003 六 | 十 《3721 一 303)m21 十 903) 
where [3(rmao + 7 六 一 (21 十 703 六 ] 
四 al bd6 = (720 一 Toz)[(730 + 7 六 一 (21 十 703 六 
人 十 4911(730 十 712 儿 721 十 7703) 
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本 

局 

FIGURE 11.22 
Plots of (a) S(r) 
and (bj (6) for 
the random 
image.(c) and (d) 
are piots of Sf(7) 
and (9) for the 
ordered Image， 








TtrrerUTDTIITTREETSNGT3Fisre enyeoinen 上 
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bc 

d 记 

FIGURE 11.23 
(a) OTiginal， 


筷 = (3m1 一 To3)(730 + 72)[(73ao0 二 Ti2) 
- 3(72i 十 703) ] + (37i 一 930)(72l + 703) 
[3(73o + 72 关 一 (721 十 03) 


An M-function for computing the moment inyariants，whlch we padded image， 
invmoments,is a direct implementation of these Seven equations, The Syntax (b) Half size 
image， 


as follows (see Appendix C for the code listing): (c] Mirrored 
C 1TTOTE 


image.(d) Image 


invmoments phi = invmoments(Tf ) rotated by 2。， 
Se (e) Image rotated 
where fis the input image and phi is a Sevelrelement ITOW Vector containin 4 .The zero 
the moment invariants just defined. padding in (a) 
through (d) was 


done to make the 





EXAMPLE 11.12: 吕 The image in Fig. 11.23(a) was obtained from an orTiginal of Size 400 X images consistentt 
Moment pixels by using the command in size with (e) for 
InVarlants. Viewing PUTPoSesS 
>> fp = padarray(f，[84 84]，'both'); Ca 
Zero padding was used to make all displayed images consistent with the 1 imag 
occupying the largest area (5368 X 568 pixels) which, as discussed below, 
Image rotated by 45". The padding is for display purposes only, and was no 
0 used in any moment computations. The half-size and corresponding RN 
0 images were obtained using the commands 
CO 让 DPed 
abodi ae Vertical >> fhs = fl(1:2:end，1:2:end) ; 
CoclS QH 
= flipud(A) >> fhsp = padarray(fThs，[184 184]， both ) 


Fefrns Amitp 纺 e 


rowsjiippedaoo The mirrored image was obtained using MATLAB function fLiplr: 
太 e AhoriZomial QTXE 


>> fn = flLiplLrf)i 
>> fmp = padarray(fm，[84 84]， both ) 





To rotate an image we use function imrotate: 


g = imrotate(f，ang1Le，method， "crop ) 





which rotates f by angle degrees in the counterclockwise direction. Parame- 
ter method can be one of the following: 





nearest' uses hearest nelghbor interpolation; 
:bilinear' uses bilinear interpolation (typically a good choice); and 
'bicubic' uses bicubic interpolation. 





The 1 size js increased automatically by padding to fit the rotation， 芷 

'crop' is included in the argument, the central Part of the rotated image 1 
cropped to the same size as the original. The defaujt ls to specify angje onjly, if 
which case 'nearest' interpolation is used and no cropping takes place， 
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TABLE 11.4 

The seven moment 
invyariants of the 
images in 

Flgs. 11.23(al) 
through (e).Note 
the use of the 
magnitude of the 
log in the first 
Column， 


Invariant (llogl) Original 。 Half Size ”Mirrored  。 Rotated 2。 Rotated 45 | 


由 6.600 6.600 6.600 6.600 6.600 

小 16.410 16.408 16.410 16.410 16.410 

人 23.972 23.958 23.972 23.978 23.973 

四 23.888 23.882 23.888 23.888 23.888 

由 49.200 49.258 49.200 49.200 49.198 

由 32.102 32.094 32.102 32.102 32.102 

由 47.953 47.933 47.850 47.953 47.954 < Image 





Tpe rotated images for OUI example were generated as follows: 了 
1H-dlimensional | 2 


colamn vector 


Image 2 






>> fr2 = imrotate(f，2， "bilinear ) ; 
>> fr2p = padarray(fr2，[76 76]， both ) 
>> fr45 = jimrotate(f，45，'bilinear'); 


Image 1 
























he Images are of size M X Nthere willbetotal of MN such nz-dimensional 
ectors Comprising all pixels in the 7 jimages. 

The mean vector mx, of a vector Population can be approximated by the 
ampjle average: 


Note that no padding was requlred in the last Image becatse lt ls the larg 
Image in the set. TIhe 0s im botB rotated 1mages werIe geneiated by IPT in tf 


Piocess of rotation. 
The seven moment Invarlants of the five Images just discussed were gen 


ated using the commands 


>> phiorlg = abs(log(invmoments(f) ) ) 1 
th 天 = AMN.Similarly, the 7  X 天 covariance matIrix Cx, of the population 


>> phihalf = abs(Jogl(invmoments(fhs))) 

>> phimirror = abs(1og(LInvmoments (fm) ) ) ; an be approximated by 

>> phirot2 = abs(1og(InvmomentSs(fr2) ) ) 1 有 

>> phirot45 = abs(LIogf(Invmoments(TFr45) ) ) 8 和 和 | JJ(xk 一 m_ )7 
一 上 KK=1 


Note that the absolute value of the 1og was used instead of the morment 这: 
variant values themselves. Use of the 1og reduces dynamic range, and the ab 
sojute value avoids having to deal with the complex numbers that Tesult whe 
computing the 1og of negative moment invarlants, Because interTest generI3 几 
lies on the inyvariance of the moments, and not on their Sign,use of the absojut 
value ls common Practice. 

The seven moments of the original, half-size, mirrored, and rotated image 
are SUummarized in Table 11.4. Note how close the numbers are, indicating- 
high degree of invariance to the changes just mentioned.Results like these ar 
the reason why moment invariants have been a basic staple in 1mage descrip 
tion for more than four decades. 


here 天 一 Tinstead of Kis used to obtain an unbiased estimate of C、from 
e Samples. Because Cx ls real and symmetric, findqing a set of mn orthonormal 
genvectors always 1$ possible. 

ThePprincipal comapomenn tras1forma (also called the 感 ole1ling tra13jforrmm) 1S 
glven by 


一 A(x 一 如 xj 


tis not difficult to show that the elements of vector y are uncorrelated.Thus， 
he covariance matrix Cy is diagonal， The rows of matrix A are the normajlized 
elgenvectors Of C,. 了 eeaiise Lx ls ieal and Symimetric, these vectors form an 
orthonormajl set， id it follows that the elements along the main diagonal of 





Using Principal Components for Description y are the eigenvalues of Cx. The main diagonal element in the ith Tow of C， 
证 Sthe variance of vector element 

Suppose that We have 1 ITegistered images，Stacked Im the aITangemen Because the rows of A are orthonormal its inverse equals its transPpose. 

shown in Fig. 11.24. There areapixels for any given Pair of coordinates (月 Jhus, we can recover the x's by performing the inverse transformation 

one Pixel at that location for each image. These Pixels mlay be arTranged in 也 

torm of a column vector ,了 X 三 人 Ay 二 In 
X1 The Importance of the Principal components transform becomes evldent when 
7 二 Only g eigenvectors ae Used， In which Case 人 becomes ag X 严 matrix, A，. 

X 一 


= Ayy 十 Im 
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FIGURE 11.24 
Forming a vector 
from 
corTesponding 
Pixeis i Iastack of 
images of the 
Same Size. 
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Imstack2vectors 
En 


COVma 上 PrEX 
上 


the xs ls given by the expresslon 


六 G 
ms 二 > 了 


| 


J=d+l 


The first bne of thlis equation indicates that the error is zero 让 4 = 7 (that is, 这 ia 
the elgenvectors are used in the inverse transformation). This equation al 
Shows that the erTor can be minimjized by Selecting for Ay the dg eigenvectors 38 
SOclated with the largest elgenvalues. Ihus,the Principal components transformii 
optimal in the sense that lt minimizes the mean Square error between the vectot 
x and their approximations X. The transform Oowes its name to using the eigen 
vectors corresponding to the largest (PITincipal) eigenvalues of the covarialic 
ImatriX, [he example given later in this Section further clarifies this concePpt. 

Asetofm registered images (each of size M X Ni)isconvertedtoa stack 
the form shown in Fig. 11.24 by using the command: 
>> 9 


= Dat(3， 下 17 27 下 人 7 









nction P = 
RINCOMP Obtain principal-Component vectors and related quantities， 


1]1.3 咽 Using Principal Components for Description 


== 1 55Handle special case， 


Compute an Unblased estimate of 由 ， 
= SuUm(X，1)7K; 
SUDtEaEt the mean from each row of X， 
= 欠 一 和 (Ones(K，1)，:); 
Compute an unbiased estimate of C，Note that the product is 
X *X because the vectors are rowSs of X， 
(X *X)A(K 一 1); 
m ;点 Convert to a_ Column vector， 


计 f 


princomp(X，9) 


= PRINCOMP(X，Q) Computes the principal-component vectors of 


the Vector population contained in the rows of X，a matrix of 


















S1Zze K-by-n Where K is the humber of vectors and n is their 
dimensionality， Q，with Values in the range [0，n]，is the number 
of eigenvectors used in constructing the principal-components 
transformation matrix，P is a structure With the foLLowing 
fields， 


This image stack array which is of Size M X N X mu is converted to an aITdi 
whose rows are 1-dimensional Vectors by Using function imstack2vectors (Set 
Appendix C for the code), which has the syntax 


In PP.Y K-by-Q matrix Whose Columns are the principal- 
2 component Vectors. 
where S is the image Stack and X is the array of vectors extracted from S usinl P.A Q-by-n principal componentS transformation matrix 


Whose rows are the Q eigenvectors of Cx Corresponding 
to the Q largest eigenvalues . 


the approach shown in Fig. 11.24. Input MASK is an AM X N logical or numeri 
image with nonzero elements in the locations where elements of S are to be 
used in forming X and 0s in locations to be ignored. For example, 这 we wanted 
to use only vectors In the right, upper quadrant of the images in the stack,then 
MASK woujd contain ls in that quadrant and 0s elsewhere. If MASK is not ip: 


from the principal-component vectors. P,X and P.Y are 
identical if Q = 





cluded in the argument, then all image locations are used in forming X. Finalky .ems The mean Square error incurred in using only the Q 
elIgenvectors Corresponding to the Largest 
parameter R ls an arIray whose rows are the 2-D coordinates coTTesponding to 6i0bjiuaiiEsi aiesigs0 de 省 二 
the location of the vectors sed to form X. We show how to use MASK in Exam- P_ cx The n-by-n covariance matrix of the population in X. 
ple 12.2. In the present discussion we use the default. P_mx The n-by-1 mean vector of the population in X。 
The following M-function, covmatrix, computes the mean vector and co- P.Cy The Q-by-Q covariance matrix of the population in 


variance matrix of the vectors in X. Y，The main diagonal contains the eigenvalues (in 
function [fC，ml = COovmatrix(X) 

%COVMATRIX Computes the covariance matrix of a vector population. 
[CC，M] = COVMATRIX(X) Computes the Covariance matrFix C and the 
mean Vector M of a vector population organized as the rows of 
matrix X.C is of Sijze N-by-N and M is of Size N-by-1，where N is: 
the dimension of the vectors (the number of coLlumns of X)， 


和 
[K，nj = Size(X); 
X = double(X); 


[k n] = size(X); 
X = double(X) ; 
% 0btain the mean vector and covariance matrix of the vectors in X， 


CCX，P.mx] = Covmatrix(X); 
= P.mx ; % Convert mean Vector to 8a row vector. 


ob oR 号 


0btain the eigenvectors and corresponding eigenvalues of CXx，The 
和 genvectors are the Columns of n-by-n matrix VD is an n-by-n 


P .xx K-by-n matrix whose rows are the vectors reconstructed 


descending order) Corresponding to the Q eligenvectors， 
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[V，0] = eigt(A) 
refrns 上 e egEHYEC- 
Or3 OFAas 1Pe 
COULIHS OF QTEX V， 
2md 切 e correspord- 
Dag etigeFVQiLE5S 
cjong 共 e 1aain diag- 
onal of diagomal /ma- 
trix D. 


下 XAMPLE I,13: 
Princlpal 
components. 
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% diagonal matrix whose elementSs al0ng the main diagonal are the 
% eligenvalues corresponding to the eligenvectors in V，SoO that Xx*V = 
朱 DxV . 

[V，D] = elig(P.CX) 


% Sort the eigenvalues in decreasing order，Rearrange the 
% eigenvectors to match， 

d = diag(Di) ; 

[d，idx] = Sort(d) ; 

d = flipud(d); 

idx = flipud(idx) ; 


D = diag(d) 
V=V(I:，1LdX) 
% Now form the q rows of A from first q Columns of V， 


P.A=V(:，1:9) | 


% Compute the principal Component Vectors， 
Mx = repmat{P.mx，K，1); 和 M-by-n matrIixX， Each Prow = P.mX， 
= P.Ar(X ~ Mx) ;5%5 q-by-K matrix， 


btain the Feconstructed Vectors， 
(P.A'*P.Y) + MX 


The mean square error is given by the Sun of 311 the 
eigenvalues minus the Sum of the 9 largest eigenvalLues， 
= diag(D) ; 


suml(d(qg+ 1:end) ) ; 


Covariance matrix of the YY S: 
.CyY = P,A*P.CX*P,A |; 


了 oo 了 卫 口 愉 吧 
《D 
E 
(六 
所 


0 


Figure 11.25 shows six satellite images of SIZe 512 X 512, corresponding to 
six Spectral bands: visible blue (450-520 nmj), VSlble green (520-600 nmy), visL- 
ble red (630-690 nm)j, near infrared 〈760-900 nmj, middle infrared (1550- 
1750 nmyj, and thermal infrared (10,400-12,500 nm), The objective of thls ex 
ample is to illustrate the use of function princomp for principal-componen 合 
work, The first step is to organize the elements of the Six Images in a stack of 
size 512 X $12 X 6, as discussed earjlier: 


>> S = cat(3，f1，f2，f3，f4， f5 TB6) 


where the f's correspond to the six multispectral images Just discussed, Ihen 
we organize the Stack Into aITaY X: 


>> [fX，R] = imstack2vectorstS) 


Next we obtain the six principal-component images by usingq = 6 in func- 


tion princomp: 
二 


>> P = princomp(X，6) |; 
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FIGURE 11.25 
SIXx multispectral 
images in the 

(a) visible blue， 
(b) visible green， 
(cj visible red， 
(d) near Infrared， 
(e) middje 
infrared, and 

(f) thermal 
infrared bands. 
(Images coUItesy 
of NASA.) 
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The other five images are obtained and displayed in the same manner. The: 
eigenvalues are along the main diagonal of P,Cy,SO We use 


>>d= diag(P.CY)， 


where dis a 6-dimensional column vector because we usedgqg = 6linthe function 

Figure 11.26 shows the six principal-component images just computed.Th 
most obvious feature is that a significant portion of the contrast detaill is con 
tained in the first two images, and it decreases Tapidly from there. [he reaso 
is easily explained by lookling at the elgenvalues， As Table 11.3 shows, the firs 
two elgenvalues are quilte large in comparison with the others. Because 蕊 
ejgenvalues are the variances of the elements of the y Vectors, and Variance ij 
a imeastre of contrast, it is not Unexpected that the images corresponding {t 
the dominant eigenvalues would exhibit significantly higher contrast， 

Suppose that we use a smajler value of q,Say q = 2.Ihen reconstruction i 13. 
based only on two Principal component images. Using 


>> P = pPincomp(X，2) 
and statements of the form 


展区 寺 让 
reshape(th1，51412，512) |; 


>> hh1 
>> hi 


for each image resulted ip the reconstructed images in Fig， 11.27, Visually 
these 1 images aTe quite close to the originals in Fig. 11.25. In fact, even the dif- 
ference images show little degradation. For instance, to compare the original 

and reconstructed band 1 images, we write 。 


>> D1 double(f1) - double(h1); 
>> D1 gsScale(D1) ; 
>> Imshow(D1 ) 


扫 


Figure 11.28(a) shows the result.The low contrast in this image is an indlca- 
tion that little visual data was lost when only two Principal component images 
were Used to ITeconstruct the original 1mage. Figure 11.28(b) shows the differ- 
ence of the band 6 images. The difference here is more Pronounced because 
the original band 6 image is actually blurry. But the two Principal-component 
images used in tbhe reconstruction are sharp, and they have the Strongest infiu- 
ence on the reconstruction. The mean square error incurred in using only fwo . 
Principal component Images ls given by 


P ,ems 
ans = 
1.7311e+003 


103352 2959 


which is the sum of the four smaller eigenvalues in Jable 11.3， 
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FIGURE 11.26 
Principal- 
component 
Images 
coOrTeSponding to 
the Images in 
Fig. 11.23. 


TABLE 11.5 
Eligenvalues of 
P,CY whengq = 
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:ab 

区 

:二 

FIGURE 11.27 
Mujltispectral 
lmages 
TeconstTucted 
using onjy the two 
Principal- 
comPonent 
images with the 
largest variance， 
Compare with the 
oOIiginals in 

本 2 
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Before leaving this section we Point out that function princomp can be used to 
gn objects (regions or boundaries) with the elgenvectorS of the objects. The co- 
ordinates of the objects are arranged as the columns of x, and we USe 9q = <. Thbe 
transformed data, aligned with the eigenvectors, js contained In P.Y， Thls 1s a 
xupgged alignment procedure that uses all coordinates to compnute the transfor- 
mation matrix and aligns the data in the directlon of its principal spread. 


TITTTLO7 TV 

e Tepresentation of objects or regions that have been segmented out of an image 1S 
an early step in the pTeparation of image data for subsequent usSe in automation. For 
sexample, descriptors such as the ones just covered constitute the input to the object 
recoghition algorithms developed in the next chapter. The M-functions developed in 
fhe preceding sections of this chapter are a significant extension to the PoweT of Stan- 

ard IPT functions for image representation and description. It 18 undoubtedly clear by 
now that the choice of one type of descriptor over another is dictated to a ljarIge degree 
“by the problem at hand. This is one of the Principal reasons why the solution of Image 

processing problems is aided significantly by having 3 flexibie prototyping environ- 
ient in which existing functions can be integrated with new code to gain flexibility and 
educe development time.The material in this chapter is a good example of hbow to con- 


struct the basis for Such an enviIOnImEent， 
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3 .bb. 


FIGURE 11.28 

(aj) Difterence 
between 

Figs, 11.27(a) and 
11.25(3). 

(b) Diftfterence 
between 

Figs. 11.27(f) and 
11.23( 提 .Both 
images are Scaled 
to the full [0, 255] 
8-bit intenslty 
Scale， 
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yhere each component,*, represents the ith descriptor and mis the total num- 
ier of Such descrmiptors associated with the pattern. Sometimes it is neceSssary 
in comPputations to use vectors of dimenslon 1 X 7 obtained Simply by 
orming the transpose， x7 ,of the Preceding column vector 

:The nature of the components of a Pattern vector x depends on the ap- 
TOach used to descrlbe the physical pattern itself For example, consider the 
rroblem of automatically classifying alphanumeric characters. Descriptors 
uUitabje for a dectslon-theoretic approach might include measures Such 2-D 
moment Inyariants Or a Set of Fourier coefficients describing the outer bound- 
TIy of the characters， 

In Some applications, pattern characteristics are best described by structur- 
1relationships. For example, fingerprint recognition is based on the interrela- 
onships of print features called maizutiae. Together with their relative sizes 
nd locations, these features are primitive components that describe finger- 
rint ridge properties, such as abrupt endings, branchinpg， merging， and discon- 
ected Segments，Recognition Problems of this type，ina which not only 
uantitative measures about each feature but also the spatial relationships be- 
Ween the features determine class membership, generally are best solved by 
ructural apPproaches. 

The material In the following sections is representative of techniques for 
plementing pattern recognition solutions in MATLAB. A basic concept 训 n 
cognition, especially in decision-theoretic applications, is the idea of pattern 
atching based on measures of distance between pattern vectors. Therefore， 

We begin the discussion with various approaches for the efficient comPputation 
of distance meastures in MAITILAB， 





JP7eviez0 


We conciude the book with a discussion and development of several M-functions 
for region and/or boundary recognltion, which in this chapter we call opjecty'of 
Paliers. Approaches to comiputerlzed pattern recognition may be divided into 
two PrIincipal areas: decision-theoretic and structural. The first category deal 
with patterns described using quantitative descriptors, such as length, area, texX- 
ture,and many of the other descriptors discussed in Chapter 11.The second cate- 
goryY deals with patterns best represented by Symbolic information，such a8 
strings, and described by the properties and relationships between those symbols 
as explained in Section 12.4. Central to the theme of recognition is the concept of 
“learing "from sample patterns.Learning techniques for both decision-theoretic 
and structural approaches are implemented and illustrated in the material that 
foljiows. 





有 Computing Distance Measures in MATLAB 


The material in this section deals with vectorizing distance computations that 
otherwise would involve for of while loops. Some of the vectorized expres- 
sions pTesented here are considerably more Subtje than most of the examples 
in the previous chapters, so the reader is encouraged to study them in detail 
Jhe following formulations are based on a summary of similar expressions 
compiled by Acklam [2002|]. 

The 尼 zciideanz distace between two mn-dimensional (Tow or column) vec- 
tors x and yls defined as the scalar 


dx 人 = 相 一 列 = 李 一 丰 =[Oea 一 证 (0 


This expression is simply the norm of the difference between the two vectors， 
0 we compnute tt usimnmg MATLABSS fanction norm: 





Background 


A Pattem 18 an arra1z8ge1itenl OF descripiors，Such as those discussed 这 
Chapter 11.The name jaturejls used often in the pattern recognition literature - 
to denote a descriptor.A Pattern cliassy is a family of patterns that share aset of 
common PIOPerties. Pattern classes are denoted ol, wz, ,ww， where 研 1 
the number of classes. Pattern recognition by imachine involves techniques for 
asSigning Patterns to their respective classes 一 automatically and with as little 
human intervention as possible. 

The two Principal pattern arrangements used in practice are vectors (for 
quantitative descriptions) and strings (for structural descriptions)，Pattern 
vectors are Tepresented by bold lowercase letters, such as X, y, and z,and have 
the X 1vector form d= norm(x 一 Y) 2 
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Te jnverse matrix operation js the most time-consuming computational task 
equired to implement the Mahalanobls distance. This operation can be Optl- 
imized significantly by using MAILABs matrix fight division operator (/) in- 
duced in Table 2.4 (see also the margin note in the following page) 
ressions for my and C, are given in Section 11.2， 
Let X denote a Population of p, ndimenslonal vectors， and let Y denote 
popujation of 9,m-dimensional Vectors， such that the vectors in botn 入 and 
re the Tows of these arrays.The objective of the folowing M-function 1S 
comptute the Mahalanobis distance between everIy vector in Y and the 


eam, na:: 


where x and y are vectors corresponding toXandyin the Preceding equatig 
for d(x, y). Often, it is necessary to compnute a Set of Euclldean distances bs 
tween a vector y and each member of a vector Popziatiom consisting of p 7 
dimensional Vectors arranged as the Tows of a 己 XP matrx 及 ,For 
dimensions to line up Properly, y has to be of dimenslon 上 X .Then the (4 
tance betweeny and each element of 和 IS contained in the 避 XXX]1lyvector 


"与 


d=Sqrt(Ssum(abs(X 一 Prepmat(y，p，1)) ,2，2)) 


where d(i) is the Euclidean distance between y and the ;th row of Xi 
X(i，:)].Note the use of function repmat to duplicate TOW vector yp time 
and thus formap Xx 天 matrix to match the dmenslions of 入 .JIhe last 2 on 
Tight of the preceding line of code indicates that Sum ls to Operate along 本 
mension 2; that is, to sum the elements along the horizontal dimension. 

Suppose next that we have two vector Populations 买 , of dimenslon P X: 淆 
andY,of dimension g X .The matrix containing the distances between ToWRS 
of these two Populations can be obtained using the expression 





inction d = mahalanobis(varargin) mahalanobis 
MAHALANOBIS Computes the Mahalanobis distancCe， Re 

D = MAHALANOBIS(Y，X) computes the Mahalanobis diSstance between 

each vector in Y to the mean (centroid) of the vectors in X，and 

outputs the result in Vector D，Wwhose length is Size(Y， 1)， The 

vectors in X andY are assuned to be organized as rowSs，The 

input data can be real of complex， The Outputs are real 

duantit1ies， 


D = Sqrt(sSsum(abs(repmat(permute(X，[132])，[1q 11) ,..， 
- repmat(permute(l(Y，[3 1 2j)，[p11])).^*2，3) 


D = MAHALANOBIS(Y，CX，MX) computes the Mahalanobis distance 
between each vector in Y and the given mean vector，MX，The 
Fesults are output in vector D，whose length 1S Slze(Y，1)， The 
vectors in Y are assumed to be organized as the rows 0f thiSs 
array，The jinput data can be real or Complex，The Outputs are 
real quantities，In addition to the mean Vector MX，the 
covariance matrix CX of a populLation of vectors X als0o must be 
provided，Use function COVMATRIX (Section 11.5) TO Compute MX and 


CX， 


Reference: Acklam，P，J，{2002] "MATLAB Array Manipulation TiLpSs 


and Tricks." Available at 
home ,online,no/-pjack1Lam/mat1ap/doc/mtt/index.html 


oO 六 at 
wwW,prenhal1l.com/gonzalezWoodseddins 
param = varargin; % Keep in mind that param isS a Cell array， 


= paranmf1}; 
= SizelY，1); % Number of Vectors In Y， 


where D(i,j) lthe Euclidean distance between the lth and jth rows of 二 
Populations; that is, the distance between X(1l,:) andY(j,，;). 
The syntax for function permute in the Preceding exXPIession 1S 


B = permute(A，order) 





This function reorders the dimensions of A accaording to the elements of the vecto 
order (the eliements of this vector must be unique). For example, 过 Aisa2 
array, the statement B = permute(A,[2 1]) simply imterchanges the TOWS an 
columns of A,which is equivailent to letting B equal the transpose of A. 革 the jengt 
of vector order is greater than the number of dimensions of A,MAITLAB 
Processes the components of the vector from jleft to right, untll all eljements afe; 
used. In the Preceding expression for D,permute(X，[1 3 2]) creates 3ITayS 
the thjrd dimension, each being a column (dimension 1) of X. Since there aTe 
columns in X, 关 Such aiTrays are created, with each array being of dimension 六 头 
Therefore, the command permute(X，[1 3 2]) creates an array of dimensio 
PPX1XDPSimilarly the command permute(Y， [3 1 2]) creates an array of dir. 


] X gg X 天 .Pi 
mensionl X g X mn.Finally the command repmat(permute(X, [1 32])，[1i f length(param) == 


1}) duplicates 9g times each of the nm columns produced by the permute function, X = param{2]， 
thus creating an 和 of dimension P X 9 X . Similar comments hold for the % Compute the mean Vector and covariance matrix 0f the VectorS 
other command involving Y. Basicajlly the preceding expression for D js Simply a， 5% inxX， 


vectorization of the expressions that would be written Using for Or While loops- 

In addition to the expressions just discussed, we use in this chapter a dis- 
tance measure from a vector y to the mean my of a vector population, weight- 
ed inversely by the covariance matrix，C.，of the population. This metnc， 
called the Mapalapnopr drstaepnceis defined as 


d(y,mx) = (y - mx) Cx (y 一 mx) 


fCx，mx] = COvmatriX(X) 
elSeif length(param) == 3 
Cx = paranm{21l; 
-fx = param{t3l 
-81S8 
erfror('Wrong number of inputS，) 


% Cov. matrix and mean Vector proviLded , 


end 
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ourier descrliptors. The value of the ith descriptor becomes the value of xi， 
he ith component of a pattern vector. In addition,， we couljd appPend other 
omPponents to pattern Vectors. For Instance, we coujld Incorporate SIX addi- 
onal components to the Fourler-descrliptor by appending to each vector the 
X meastres of texture in Table 11.2. 

Another approach used qulte frequently when dealing with (registered) 
ultispectral images ls to stack the Images and then form Vectors from corTe- 
ponding pixels im the images, as jllustrated in Fig. 11.24. JIhe images are 
acked by Using funpctlon Cat: 


mx = mxf(:) ;5% Make Sure that mx 1S a row Vector， 


% Subtract the mean Vector from each vector In Y， 
YC=Y--nmx(ones(tny，1)，:); 


%5 Compute the Mahalanobls distances， 
d = real(Sum(YC/ACx,*conj(Yc)，2) ) ; 


The call to real in the last line of code is to remove “numeric nOlSe,”8S 
































S = cat(3，f1，f2，...，fn) 


here Sls the stack andf1,f2，,..，fnarethe Images from whlch the stack 1S 
formed. The vectors then are generated by using function imstack2vectors 
iscussed in Section 11.9. See Example 12.2 for an iustration， 


Decision-theoretic approaches to recognition are based on the Use of decision 、 
(also called discripazinazmb jactio1s. Let X = (X1，X2，.， .2Xn) Tepresent a 
1-dimensional pattern vector, as discussed ln Section 12.. 于 厂 pattern class: 
es ol, oz, ,. ,wwythe basic Problem in decislon-theoretic Pattern Tecognition 
is to find 琴 decision functions di(x), Gd2(X)，..，Gw(X) with the Property thati 
a pattern X belongs to class mi, then 


由 :了 


In other words, an Unknown pattern xX is Sald to belong to the ith Pattern ClaSS 
认 upon substitution of x into all decision functions, di(X) yields the largest nt 
merical value. Ties are resolved arbitrarlly. 

The decisiomr porndary separating class wi from wj is given by Values of x for 
which di(x) = 起 (xX) or equivalently, by values of x for which 


dx) 一 dx = 0 


Cormmon practice is to express the decision boundary between two classes by 
the single function dij(x) = CCX) 一 dx) = 0.Thus ditz) > 0for Patterns 
class wi and dij(x) < 0for patterns of class 中， 

As will 0 clear in the following sections, finding decision functions 
entails estimating parameters from patterns that are Tepresentative of tbe 
classes of interest. Patterns used for parameter estimation are called trainaing 
patterns, or 1rainingser.Sets of patterns of known classes that are not used for 
training but are used instead to test the performance of a Particujar recognr 
tion apProachb are referred to as lesf or idepedezr patterns OF Sets. The prin- 
cipal objective of Sections 12.3.2 and 12.3.4 ls to develop varlous approaches 
for finding decision functions via the use of parameter estimation 人 fom train- 
ing sets. Section 12.3.3 deals with matching by correlation, an approach that . 
could be expressed in the form of decision functions but is traditionally Pre-” 
Sented in the form of direct Image matching Instead. 


2.3.2 Pattern Matching Using Minimum-Distance Classifiers 


SUPpose that each pattern class, oj 18 characterized by a mean Vector muj .That 
ds we Use the mean vector of each population of trainjing vectors as being rep- 
Tesentative of that class of vectors: 


mj= 羡 双 x 7 = 1 2,..., 厂 


Aixso 


Where Ni ls the number of training pattern Vectors from class wj and the Sum- 
mation j 和 taken over these Vectors. As before, Wis the number of pattern class- 
es. One way to determine the class membership of an UKzaowm pattern Vector 
Xis to assign it to the class of its closest prototype. Using the Euclidean dis- 
stance as 3a Imeasure of closeness (i.e., Simjlarity) reduces the problem to com- 
puting the distance measures: 


Di(x) = |z 一 mi 2 


We then assign x to class wj 让 Di(x) is the smallest distance. That is, the small- 
Seest distance Implies the best match in thlis formulation， 

， Suppose that all the mean vectors are organized as IOows of a matrix ML. 
AIhen computing the distances from an arbitrary patternx to all the mean vec- 
tors is accomplished by using the expresslon discussed in Section 12.2: 


d= sqrtf(sSum(abs(M 一 repmat(X，WW，1)). 2，2)) 


Because all distances are positive, this statement can be simplified by ignoring 
he sqrt operation. The minimum of d determines the class membership of 
pattern vector X: 


弛 


Forming Pattern Vectors 


As noted at the beginning of this chapter, pattern vectors can be formed 人 fom 
quantitative descriptors, such as those discussed in Chapter 11 for regions 
and/or boundaries. For example,suppose that we describe aboundary by using : 


>> Class = find(d == min(d) ); 


n other words, if the minimum of d is in its Kth Position (Le., X belongs to the 
kth pattern class), then scalar Class will equal k. If more than one minimum 
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other Words, spatial correlation can be obtained as the inverse Fourler 
ftransform of the Product of the transform of one function tmes the conjugate 
bfthe transform of the other. Conversely, it fojlows that 


jie Ja 人 05 


This second aspect of the correlation theorem is included for compjeteness. 
inot used in this chapter. 

Implementation of the first correlation result in the form of an M-function 
is straightforward, as the following code shows. 


exists, class would equal a vector, With each of its elements polinting to a di 
ferent location of the minimunn， 和 

If instead of a single Pattern, we have aset of patterns arranged as the Iow 
of a matrix, X, then we use an expression similar to the longer exXpression if 
Section 12.2 to obtain a mmatrix D, whose element D(I, J) ls the Euclidean dis 
tance between the ith pattern vector in X and the jth mean Vector in M.Thus't' 
find the class mnembership of say the ith pattern in X, we fnd the coiumn loca 
tion in rowiofpDthat yields the smajllest value,.Multiple minima yield maultipl 
values, as in the single-vector case discussed in the last paragraph. 人 

It is not difficult to show that selecting the smajlest distance is equivalentt' 


evaluating the functions function g = dftcorr(T，W dftcorr 


5DFTCORR 2-D correlation jin the frequency domain， ne 
5 6G= DFTCORR(F，W) perfornms the correlation of a mask， 骨 ，With 

image F. The output，G，is the correlation image，of ClLaSS 

double，The output is of the Same Size as F， When，aS is 

generally true in practice，the mask image 1IS much smaller than 

G，wraparound error is negligible 寺内 is padded to SizetFf)， 


N] = Size(Tf) ; 

下 下 二 开 二 汪 

= Conj (fft2(W，M，N) ) 
reall(ifft2(w.*fT) ) ; 


and assigning x to cjass wj 让 di(x) yields the Largest numerlical value. This for 
mulation agrees with the concept of a decision function defined earjler. 

The decision boundary between classes wi and wj for a minimum distanc 
classlfier 1S 





dijix) = QiX) 一 CS AN 

Figure 12.1(a) shows an image of Hurricane Andrew, in which the eye of 了 EXAMPLE 12. 
he storm is clearly visible. As an example of correlation, we wish to find the Using cOITelation 
aocation of the best match in (a) of the eye imapge in Fig. 12.1(b). The image 1S fior 本人 人 

of sjize 912 X 912 pixejs; the mask is of size 32 X 32 pixels. Figure 12.1(c) is the We 

Tesujt of the following commands: 


本 了 (mi 一 mi) (mr+m)=0 
The surface given by this equation is the perpendicular blsector of the jne Seg 
ment joining mi and mj. For 7 = 2, the perpendicujar bisector is a line, fo 
六 一 3itisaplane,andforz >3itis called apyperpiane. 
>> g = dftcorr(fT，W) 
>> gs = gsScale(g) 1 


12.3.3 Matching by Correlation 
>> imshow(gs ) 


Correlation is quite simple in brinciple. Given an image (xz, y),， the correja-， 
tion problem is to find all Places in the image that match 3a gIVen Sublmage- 
(also calied a jaasK or temaplate) W(z J 作 .Typically (xc y) is much smailler 
than F(xz, 由 .One approach for finding matches is to treat W(X, 》) as 3 Spatial 
filter and compute the sum of products (or a normajlized version of it) for ea3ch: 
location of 由 in 六 in exactly the same manner explained in wectlon 3.4.1.Thern: 
the best match (matches) of w(xz, y) in jz, y) is (are) the location(s) of tbe- 
maximum value(s) in the resulting correlation image. Unless W(xX, y) 1 smaj,， 
the approach just described generally becomes computationally intenslve. For 
this reason, practical imnplementations of spatial correlation typPlcally rely on 


The blurring evident in the correlation image of Fig. 12.1(c) should not be a 
urprise because the image in 12.1(b) has two domipnant, nearly constant Te- 
ions, and thus behaves Simailarly to a lowpass filter. 

The feature of interest is the location of the best match, which, for correla- 
ion, implies finding the location(s) of the highest vaiue in the correlation 


,JU] = find(g == max(g(:))) 


hardware-oriented solutions. 554 
For prototyping, an alternative approach is to implement correlation 1n te 
frequency domain, making use of the correjlation theorem, which, like the con- 
203 


volution theorem discussed in Chapter 4, relates spatial correlation to the 
product of the image transforms. Letting“。”denote correjation and he 
complex conjugate, the correlation theorem States that 


FUx, yeW 人 (2 人) 全 下 (2) 瑟 TD 


n this case the highest value is unique.As explained in Section 3.4.1, the coor- 
dinates of tbe correlation image correspond to displacements of the template， 
o coordinates [IT, J] correspond to the location of the bottom, jefit COfrneT Of 
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#b where PLX/owi) is the probability density function (PDF) of the pattern vectors 
Cd of class wj, and P(wi) is the probabijity (a scalar) that class wj OcCUTITS. As be- 
FIGURE 12.1 fore, given an unknown pattern Vector the process js to compute a total of 环 
(a) Mujtispectral decislon functions and then assign the pattern to the class whose decision 
image of function yielded the largest numerical value Ties are resolved arbitrarily 

The case when the probability density functions are (or are assumed to be) 
(b) Template, Gausslan 1S of particular practical interest The 5-dimensional Gaussian PDF 
(c) Correlation of has the form 

Image and 

as D(x/w)) = 二 -5[(x-m)7c7(Gx-m 

(d) Location of ! 


人 C 
_ the best match (2 和 2 人 
(Original image 
COUrtesy of 


Where Cr and m / are 外 e covariance matrix and mean vector of the pattern 
NOAA-) 


.Population of class wj, and |Cj| is the determinant of C 交 
Because the lopgarithm is a monotonically increasing function choosing the 


Igest di(X) to classify patterns js equivalent to choosing the largest in [di(x)]， 
So we can use instead decision functions of the form 


Ci(X) 人 InlP(xWwj)P(owj)] 

= Powi) + ln 已 (oj 

where the logarithm is guaranteed to be real because P(X/ wjand P(wr) are non- 
negative, Substituting the expresslion for the Gaussian PDF gives the equation 


7 工 | 
di(X) = ] Poj) 一 也 27T 一 了 lnjCi 一 了 [xs 一 mi) Cri(x 一 ID) 


Jhe term (m/2) ln 27 is the same Positive constant for all ciasses, so it can be 
eleted, yielding the decision functions 










1 1 
dg = 了 P(w) -inc 一 MKxmrcclr 
5ee li8.3 了 74jorm the template. If the template were So located on top of the Image, we would 人 网 2 miCi 2 [一 四) 1 


etPiaaton of ohe find that the template aligns quite closely with the eye of the hurricane at 下 
ER for /= 1 2 了 .The term inside 也 brackets is re zed - 
人 those coordinates. Another approach for finding the locations of the matches / The term insi 站 k ”Tcognlized as the Maha 


全 janobis distance discussed in 9ection 12.2,for which we have a vectori dimple- 
ls to threshold the correlation inage near lts InaxImum, or threshold its scaled SC RSeC ID| 


Verslon, gs, whose highest value is known to be 255. For example, the image in 


COVari .3, So i i i 
5 [六 作 区 Ci vallance matrix fom Section 11.5, so Implementing the Bayes classifier for the 


multivariate Gaussian case is Straightforward, as the following function shows. 
>> imshow(gs > 254) 基 function d = bayesgauss(X，CA，NMA，P) bayesgauss 


加 ， : SBAYESGAUSS Bayes classifier for Gaussian patterns， 
Algning the bottom, left corner of the tempjlate with the Small white dot jp 


D = BAYESGAUSS(X， CA，MA，P) computes the Bayes decisijon 
Tig， 12.1(d) again reveals that the best match is Dear the eye of the functions of the patterns in the rows of array X Using the 
hurricane， 明 covariance matrices and and mean Vectors provided in the arfpFayS 
人 CA and 愉 A.，CA is an array of Size n-by-n-by-W，where n js the 


:4 Optimum Statistical Classifiers 


The well-known Bayes classifier for a 0-1 loss function (Gonzajez and Woods 
[2002]) has decision functions of the form 


di(x) = P(xWoj)P(w) = 1.2 .了 


dimensionality of the patterns and W is the_ number of 
classesS，Array MA is of dimension n-by-W (I,e.，the columns of 人 MA 
are the individual mean Vectors)，The location of the CoOVvariance 
matrices and the mean Vectors in their respective aprrays must 
correspond，There must be a covarlance matrix and a mean Vector 


op op op op 加 op 加 oo 
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for each pattern Class，even if Some of the covariance matrices 
and/or mean vectors are equal, X is an array 0f S1z6 K-by-n， 
where K is the total number of patterns to be Classified (1,.e.， 
the pattern Vectors are roWS 0f Xx)， P is a 1-by-W array， 
containing the probabilities of occurrence of each ClasS. 开 f 

P is not included in the argument 1List，the ClasSeSs are 3aSSumet 
to be equal1y Likely， 


the class_ number assigned to the Ith vector in X during Bayes 
classification， 


d=[];%Initialize d. 
error(nargchk(3，4，nargin)) % Verify correct no， of inputs ， 


Ge ooececeoe oo 吧 吕 只 吕 


n = Size(CA，1); % Dimension of patterns . 
% Protect against the possibility that the Cl1asSS number 1IS 
% included as an (n+f)th element of the Yectors， 
X = double(X(:，13:n) ) ; 
W = Size(CA，3); 5 Number of pattern Cl1aSSeS， 
K = size(X，1); % Number of patterns to ClaSSify， 
if nargin == 

P(1:W) = 1/W; % Classes assumed equal1Ly Jikely， 
e1LSe 

if Sum(P) ~= 1 

error('ELements of P must Sum to 1，); 

end 
end 
% Compute the determinantS ， 
for J = 1 : 削 

DM(J) = det(CA(:，:，Jyd))) 
end 


% Compbute inverses，usSing right diviSlon (IM/CA) ，where IM = 
% eye(size(CA，1)) is the n-by-n identity matrix，Reuse CA to 
5 COnServe memory， 
IM = eyef(Size(CAT) ) 
for yd = T: 必 
EAI( 二 汪 过 了 全 人 和 人 人 过 3 过) 
end 
% Evaluate the decision functions. The Sum terms are the 


% Mahalanobis distances discussed in Section 12.2， 
MA = MA'; %5 0rganize the mean Vectors as 『oWS， 


for I = 1 :长 
for J = 13: 必 
由 = MA(J， 2:) 
Y=X--m(ones(sSize(X，1)，1)， 2 
if P(J) == 0 
D(I，J) = ~Inf; 
61Se 
D(I，J) = log(P(J)) - 0.5*10g(DM(J)) ，…， 


-0.5*Sum(Y(I，:)*(CA(:，:，d)*Y(T， 





The output，D，is a column vector of length K， ItS Ith element is 








ehnad 
.end 
end 
“ Find the maximum in each row of D， These maxinma 
% give the Class of each pattern: 


TI，:) == max(D(I，?;))); 
(1) 

end 

% When there are multiple maxima the deciSion 1S 
% arbitrary，Pick the flirst one， 

d=d(:， 1); 


| 


暑 也 ayes recognition is used frequently for automaticajlly classifying regions 如 
multispectral imagery. Figure 12.2 shows the first four images 攻 om Fig. 1.23 
[three visual bands and one infrared band).As a simple illustration, we apPly 
the Bayes classification approach to three types (classes) of Teglions 1n0 these 
images: water, Urban, and vegetation, The Pattern Vectors in thjs exarmpjle are 
formed by the method discussed in Sections 11.5 and 12.3.1, im whlch coOTTe- 
sponding pixejls in the images are organized as VectorIs. We are dealing with 


four images, so the pattern vectors are four dimenslonal. 


To obtain the mean vectors and covariance matrices, we need Sampjles Tep- 
resentative of each pattern class.A simple way to obtain such Samples InteTac- 
fively is to use function roipoly (see Section 5.2.4) with the statement 


>> B = Foipoly(f); 


where f js any of the mujtispectral images and B is a binary mask image, With 
this format, image B is generated interactively on the screen, FiguTIe 12.2(6) 
shows a composite of three masKk Images, B1, 82， and B3, generated Using thils 
method.The numbers 1,2,and3 identify regions containing Samples TePIeSen- 
tative of water, urban development, and vegetation, Tespectivejy、 

Next we obtain the vectors corresponding to each region, The four lmages 


already are registered spatially, so they simply are concatenated along the 


third dimension to obtain an image stack: 
>> Stack = cat(3，f1，f2，f3，f4) ; 


Where f1 thorough f4 are the four images in Figs. 12.2(a) through (d). Any 
point，when viewed through these four images，corresponds to 3 four- 


二、 dimensional pattern vector (See Fig, 11.24). We are ipterested ID the VectorIS 
1， contained in the three regions shown in Fig. 12.2(e), which we obtain by using 
fanction imstack2vectors discussed in Section 11.3: 


>> [X，R] = imstack2vectors(Stack，B) 


了 .where Xis an array whose IOWS are the Vectors， and Ris an array whose IOWS 
上 are the locations (2-D reglon coordinates) corresponding to the Vectors In X. 
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3a hb 
cd 
FIGURE 12.2 
Bayes 
classification of 
mujtispectral 
data. 

(a)-(c) Images in 
the bjue, green， 
and red visible 
wavelengths. 

(d) Infrared 
Image. (e) Mask 
showing Sampie 
reglons of water 
(1 urban 
development (2)， 
and vegetation 
(3). (人 Results of 
”lassification. The 
black dots denote 
points classified 
incorrectly. The 
?ther (white) 
2oints in the 
-eglons Were 
classlfied 
coOrrectly 
Original images 
:OUItesy of 
NASA.) 























Using imstack2vectors with the three masks B1,B2,and B3 yielded three 
Vector Sets, X1, X2, and X3, and three sets of coordinates, R1, R2, and R3. Then 
three Subsets Y1,Y2,and Y3 were extracted from the X'stouse ag tralining Sam- 
ples to estimate the covariance matrices and mean vectors.The Ys were gen- 
erated by skipping every other row of X1, X2, and X3. The covariance matrix 
and mean Vector of the vectors in Y1 were obtained with the command 


>> [CT1，nml] = covmatrix(Y1) ，; 


and Similarly for the other two classes. Then we formed aiTays CA and MA for 
Use in bayesgauss as foljlows: 


YV 
V 
C7? 
思 
上 


cat(3，C1，C2，C3)， 
cat(2，mlf，m2，m3)， 


V 
V 
乏 
万 
上 


Jhe performance of the classifier with the training Patterns was determined by 
人 4 classlfying the training sets: 


>> dY1 = bayesgauss(Y1，CA，MA) ， 


and similarly for the other two classes.The number of misclassified patterns of 
class 1 was obtained by writing 


>> IY1 = find(dY1 -= 1)， 


Pinding the class into which the patterns were misclassified is Straightftorward， 
For instance, Length(find(dYf == 2)) gives the numpber of patterns from 


class ] that were misclassified into class 2.The other pattern sets were handied 
in a Similar manner. 


patterns from the urban area.This js not Unexpected, as Vegetation js present 
there also (note that no Patterns in the urban or vegetation areas were mis- 
:Classified as water). Figure 12.2(f) shows as black dots the potmnts that were 
-Imisclassified and as white dots the points that were classified correctly in each 
2 region. No black dots are readily visible in region 1 because the 7 misclassified 
points are very close to, or on ,the boundary of the white reglion， 

Additional work would be required to design an operable recognition SYS- 
tem for multispectral classification. However the Important Point of this ex- 
ample 1s the ease with which such a System could be Prototyped using 
MATLAB and IPT functions， compilemented by some of the functions devel- 
oped thus far in the book. 肖 
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TABLE 12.1 Bayes classification of multispectral image data. 


Independent Patterns 
No. of Cjassified into Class 0 


Training Patterns 
No. of 《Classified ipto Class 2% 


Class Samples 1 2 3 (Correct| Class 9amples 1 关 ，“ 且 Correct 





ji Adaptive Learning Systemas 

The apPproaches discussed In Sections 12.3.1 and 12.3.3 are based on the use of 
Sample Patterns to estimate the statistical Parameters of each Pattern class. 
The minimum-distance cjassiflier 1s Speclified completely by the mean vector of 
each cjass. Similarly, the Bayes classifier for Gaussian popujlations ls Specified 
completely by the mean vector and covarlance matrIxX of each ciass of patterns. 

In these two apProaches,training ls asimple matter. TIhe training Patterns of 
each cjass are used to compnute the parameters of the declsion functilon corre- 
sponding to that class. After the parameters In quUestion have been estimated， 
the Structure of the classjfier js fixed,and its eventual performance wlll depend 
on how well the actual pattern Populations Satisfy the underilying Statistical asS- 
Sumptions made in the derivation of the classification method being used. 

As long as the pattern ciasses are characterlized, at ljeast approximately by 
Gausslan probabijlity density functions, the methods Just discussed can be 
quite effective. However, when this assumption is not valid, designing a statis- 
tical classifier becomes a much more difficuit task because estimating multi- 
variate prIobability density functions ls not a trivial endeavor. In Practice, Such 
decision-theoretic problemas are best handied by methods that yield the re- 
quired decision functions directly via training. Then making assSumptions Te- 
garding the underlyinag Probability density functions or other Probabilistic 
information about the Pattern cliasses Under consideration 1S UnDeceSSarTY. 

The Principal approach in use today for this type of classification is based on 
neural netwotks (Gonzalez and Woods [2002j).The scope of Implementing neurT- 
al networks suitable for image-processing applications is not beyond the capabi 
ities of the functions available to us in MATLAB and IPT However, this ettort 
would be unwarranted in the present context because a comprehensive neurIal- 
networks toolbox has been avajlable from The MathWorks for several years，. 





:。 Structural Recognition 


Structural recopgnition techniques are based geneIaly on Tepresenting objects of 
interest as strings, trees, Or graphs and then defining descriptors and recognltlon 
rules based on those representations. The Key difference between declslonD- 
theoretic and structural methods is that the former uses quantitative descIIPtOIS 
expressed in the form of numeric vectors. Structural techniques, on the other 
hand, deal principally with symbolic information. For instance, SUPPose that 0b- 











ject botndaries in a given application are Iepresented by minimum-perimeter 


pojygons. A _ decision-theoretic approach might be based on forming vectors 
whose elements are the numeric values of the interior angles of the polygons， 
whlle a structural approach might be based on defining symbols for ramgey of 
angle yalues and then forming a string of such symbols to describe the patterns. 

strings are by far the most common representation used in structural reCOgnl1- 
tion, SO we focus on this approach in this section. As wil become evident shortiy 


MAITLAB has an extensive set of functions specialized for string manipulation. 


2.4. Working with Strings in MATLAB 


MAILAB,astringis aone-dimensional array whose components are the nu- 


meIlc codes for the characters in the string.The characters displayed depend on 
the character set used in encoding a given font, The engtp of a strinpg is the num- 
ber of characters in the string, including spaces. It is obtained using the familiar 


_function length.A stringis defined by enclosing its characters in single quotes (a 


textual quote within a string is indicated by two quotes)， 
Table 12.2 lists the Principal MATLAB fanctions that deal with strings.t 
Consldering first the general category function blanks has the syntax: 


s = blanks(n) 


jlt generates a string consisting of n blanks. Function cellstr creates a cell 
alTay of strings from a character arTay. One of the principal advantages of stor- 
ing strings in cell arrays is that it eliminates the need to pad strings with blanks 
to Create character aITays with rows of equal length (e.g., to perform string 


comparisons). The syntax 


C = Cellstr(S) 


places the rows of the character arTay S into separate cells of c. Function char 
ls used to convert back to a string matrix. For example, consider the strin 
matrix 


>>S= [ abc'; 'defg'; 'hi  '] s%5 Note the blLanks， 


abc 
defg 
hn 这 


二 Typing whos S at the prompt displays the following information: 


>> Whos S 
Nanme S1LZze BytesS CLaSS 
S 3X4 24 char array 


Some of the string functions discussed in this Section were introduced in earjier chapters. 
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TABLE 12.2 
MATLAB'S 


itrIng- 


manipulation 


fanctions. 





Category 


(Ceneral 


String ftests 


String operations 


String to number 


cOnyersionm 


也 ase ntniber 


coOnyYersion 





Function Name 





blLanks 
cellstr 


char 
deb1lank 
eVal 
IISCelI1lstn 
1IScChar 

革 S1ettenr 
IsSpace 
0OWer 
regexp 
Fegexpi 
regexprep 
StrFCcat 
strcmp 
Strcmpi 
Strfind 
Strjust 

S 七 六 matcl 
Strncmp 
strncmpi 
strread 


strrep 
Strtok 
strFvcat 
Upper 
doub1e 
1nt2Str 
mat2str 


num2stnp 
Sprint 了 
str2doub1e 
str2num 
SSCanf 
base2dec 
bin2dec 
dec2base 
dec2bin 
dec2hex 
hex2dec 
hex2nunm 


上 xpjanation 


string of blanks, 
Create cell array of strings from character 
arTray Use function char to convert back toa 
character string. 

Create character array (string). 

Remove tralling blanks. 

人 Execute string with MATLAB expression. 
Jrue for cell array of strings， 

Jirue for character arTay. 

True for letters of the aiphabet. 

True for whitespace characters. 

Convert string to lowercase. 

Match regular expression， 

Match reguiar expression, ignoring case， 
Replace string using regular expression. 
Concatenate strings. 


_ Compare strings (see Section 2.10.5). 


Compare strings, ignoring case, 

Find one string within another. 

Justify string. 

Find matches for string. 

Compare first n characters of strings、 
Compare first n characters， 1gnoring case，. 
Read formatted data from a string, See 
Section 2.10.5 for a detailed explanation。 
Replace a string within another. 

Find token in string. 

Concatenate strings vertically 

Convert string to uppercase， 

Convert string to numeric codes. 

Conyert integer to string. 

Conyvert matrix to a string suitable for 
Processing with the eval function， 

Convert number to string. 

Write formatted data to string. 

Lonvert string to double-precision value， 
Conyvert string to number (see Section 2.10.5). 
Read string under format control， 

Convert base B string to decimal integer. 
Convert binary string to decimal integer. 
Conyert decimal integer to base B StTring. 
Convert decimal integer to binary String， 
Convert decimal integer to hexadecimal string 
Conyert hexadecimal string to decimal integer， 
Coliyert IEEE hexadecimal to double- 
Precision number， 
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Note in the first command line that two of the three Strings in S have trailing 
blanks because all rows in a string matrix must have the same number of char- 
acters. Note also that no quotes enclose the Strings in the output because Sisa 
character 3aTTray The following command returnsa3 x 1 ce] array: 


>> C = Cellstr(s) 


C 一 
”abc， 
efg， 
上 工 ， 
>> Whos 5C 
Name Size Bytes CaSS 
C 3X1 294 Cell array 


Where, for example,c(1) = ' abc'.Notethat quotes appear around the strings 
in the output, and that the strings have no tralling blanks.To convert back toa 
String matrix we let 

















和 2 = charfc) 
和 “一 
abcC 
defg 


hi 



































量 


ia 吕 要 罗 2 
证 0 0 


2 ple, iscellstr(c) would return a 1 and iscellstFr(S) would return a 0 2 


上 





Function eval evaluates a String that contains a MATLAB eXpresSion. The 
call eval(expression) executes expression, a String containing any valid 
MATLAB expression, For eXample, it is the character string t= '3^2， tyPp- 
ing eval(t) returns a 9， 

The next category of functions deals with String tests.A lis returned ifthe 
funtion is true; otherwise the value returned is 0 Thus,in the Preceding exam- 


Wi 
本 


:SCellstr 
0 人 





similar comments apply to the other functions in tbis category. 
String operations are next, Functions lower (and upper) are self explana- 


| tory They are discussed in Section 2.10.3. The next three functions deal with 


/reSuiar expressio1s, which are sets of Symbols and syntactic elements used 
commonly to match patterns of text. A Simple example of the power of regular 
exXPpTIesslons is the use of the familiar wildcard Symbol “*”in a file search. For 
instance, asearch for Imagey#.min a typlical search command window would re- 
turn all the M-files that begin with the word image” Another example of the 
use of regujlar expressions is in a Search-and-replace function that searches for 
an Instance of a given text String and replaces it with another. Regular expres- 
Sions are formed using 7tetacparacters, Some of which are listed in Table 12 3 


和 
直 。 。 。 

Regular expressions can be traced to the work of American mathbematician stephen KKleene, who devel- 
9ped reguiar expressions as a notation for describing what he called "the algebra of regular sets,… 


ER 
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TABLE 1 Metacharacters 

Some of the 

metacharacters Matches any one characteT， 


Matches any one of the characters, (ab,.. 小 contained within 
the brackets. 


used in regular 
eXpressions for 







matching. See the [ 关 a 提 sse] Matches any character eXcept those contained within the 
regular brackets. 
expressions ? Matches any character zero oOT One times. 
help page for a 上 Matches the preceding element Zefo OT ImOIe tlmes. 
comaplete jlst， + Matches the Preceding eljement one oT mmOIe times. 
{num} Matches the preceding element num tlmaes， 
{fmin，max} Matches the preceding element at least min times,but not 
more than max times， 
| Matches either the expression preceding or following the 
metacharacter | . 
^chars Matches when a String begins with chars. 
Chars$ Matches when a string ends with chars. 
\<chars Matches when a word begins with chars. 
chars\> Matches when a word ends with chars. 






\<WOrdA\> Exact word match， 


In the context of this discussion,a“word” 1S a substring within a String， PIeced- 名 


edbyaspace or the beginning of the string,and ending with a space or the end 
of the string. Several exampjles are given in the following paragraph. 
Function regexp matches a Tegular eXPIesSloD， Using the basic SYntaX 


idx = regexp(StFr，eXxpr) 





returns a Tow vector, idx, containing the indices (locations) of the substrings 


in str that match the regujar expression string, expr, 上 or eXample, SUPPOSe 
that expr = 'b.*a'. TIhen the expression idx = regexp(str， expr) would 
mean find matches in string str for any b that is followed by any character (38 
specified by the metacharacteT “2)] any DuUmber of times， including zero times 
(as specified by *),followed by an a. Ihe indices of any locations in str meef- 
ing these conditions are Stored in vector idx, If no such locations 3Ie found， 
then idx is returned as the empty matITIX. 

A few more examples of regular eXPreSSl10nS for expr should clarify these 


concepts.The regular expression "b, + as would be as in the Preceding exXaIm- 


ple,except that“any numbeT of times ,including zero times”would be replaced 
by“one or more times. The expression“b [0-9] means any b followed by 


any number from 0 to 9; the expression 'b [0-9]* means any b followed by 加 


any number from 0 to 9 any number of times': and 'b [0-9] +' Imeans b fol- 
lowed by any number from 0 to 9 one OF mOIe times. For example, if Str = 
,bp01236C234bcd', the preceding three instances of expr would give the fol- 
lowing results: idx = 1i idx= [1 10]; and idXx = 1 


As an example of the use of regular eXpTessions for recognizing object char- 到 


acteristics,suppose that the boundary of an object has been coded with a four 
directional Freeman chain code [see Fig, 11.1(aj], stored in Strlng Str, SO that 




















































>> St 
Str = 
000300333222221111 


Suppose also that we are interested in finding the locations in the String where 
the direction of travel turns from east (0) to south (3), and stays there for at 
least two increments, but no more than Six increments. This 1s a “downward 
step” feature in the object larger than a single transition, whjich may be due to 
noise. We can express these Tequirements in terms of the foljlowing Tegular exX- 
Pressiomn: 


>> expr = '0[3]{2，6} ! 
Then 


>> idx = regexp(Str，expr) 


idXx = 


The value of idx jidentifies the point in this case wherea0 is followed by three 
3s. More complex expressions are formed in a simllar manneT 

Function regexpi behaves in the manner just described for regexp,except 
that it ijgnores character (upper and lower) case， Eunction regexprep，with 
SYntaX 


s = regexprep(Sstr，expr，rep1lace 1) 


replaces with string replace all occUIITences of the regular exXpression expr In 
string, str, The new string 1 returned. 过 no matches are found regexprep re- 
turns str, unchanged. 

Function strcat has the SyDntaxX 


CCSRES SS25 .39 


This function concatenates (horizontajly) corresponding rows of the character 
arrays S1, S2, S3, and so on,. All input arrays Inust have the Same number of 
TOws (or any can be a single String). When the InpPuts are ali character aITayS， 
the output is also a character array. 下 any of the inputs is a cell arITay of 
Strings, strcat Ieturns a cell array of strings formed by concatenating COTTe- 
Sponding elements of S1, S2, S3, and So on. TIhe inputs Inust all have the Samie 
， Size (or any can be a scalar). Any of the inputs can also be character aifTayS. 
-railing spaces in character arITay inputs are ignored and do not appear in the 
output. This is not true for inputs that are cell arrays of Strings. IO PITeSeTVe 
trailing spaces the familiar concatenatlon SYntaX based on square brackets， 
【[S1 S2 S3 .， .],should be used.For example， 
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>> a = 'hello ' 5 Note the trailing blank Space， where S and T can be cel arrays of strings, returns an arIay R the Same size as 

>> b = 'goodbye， Sand T contalining l for those elements of S andT that match (up to n charac- 

>> Strcat(a，b) tersj, and 0 otherwise. S and T must be the same size (or one can be a scajar 

ans = cei 山 . Elither one can also be a character arTay With the correct number of rows. 

hel1ogoodbye The command strncmp is case sensitive,Any leading and trailing blanks in ei- 有 

[a bj] ther of the strings are included 记 the comparison. Function strncmpi per- .stkncmpi 
ans 三 forms the same Operation as strncmp, but ignores character case. 0 


hello goodbye 
Function strvcat, with SyntaxX 
S = Strvcat(t1，t2，t3，.，，,) 


forms the character array S _ containing the text strings (oOI String Iatrices) 
t1,t2,t3，..，as IOwWS. Blanks are appended to each string as necesSaTy to 
form a valid matrix. Empty arguments are ignored. For example，uslng the 
strings a and b in the previous exarmple， 


>> Strvcat(a，b) 
ans = 


helJlo0 
goodbye 


Function strcmp, with SyDtax 


k = StPFcmpl(sStrT，Str2 ) 
compares the two strings in the argument and returns 1 (true) 下 the StTings 
are identical. Otherwise it returns a0 (false).A more general SyntaX 1S 


K = strcmp(S，T) 


where either SorTis a cell array of strings, and Kis an arTay (of the same SIZe 
as S andT) containing 1s for the elements of S andTthat match, and 0s for the 
ones that do not.SandT must be of the same size (or one can be a scalar cej)， 
Either one can also be a character arTay with the proper humber of TOWsS， 
Function strcmpi performs the same operation as strcmp,but it ignorTes char- 
acter Case. 
Function strncmp, with Syntax 
k = Strncmp(' strf ， Str2 ，n) 

returns a logical true (1) 过 the first n characters of the strings str1 and str< 
are the same, and returns a logical false (0) otherwise. Arguments str1 and 
str2 can be cell arrays of strings also. Ihe SyntaxX 


R = strncmp(S，T，Pmn) 





Eunction strfind, with syntax 
TI = strfind(str，pattern) 


Searches String str for occurrences of a sporter string, pattern， Teturning the 
starting Index of each such occurrence in the double array, TI.If pattern is not 
found in str, or 于 pattern is longer than str, then strfind returns the 
empty array, [ ]. 

Function strjust has the syntax 


Q = Strjust(A，direction) 


where Ails a character arTay, and direction can have the justification values 
right ，jleft ,and ' center ' .The default justification is :right' The out- 
Put array contains the same Strings as A, bnit justified in the direction specified， 
Note that justification of a string implies the existence of leading and/or trail- 
ling blank characters to provide space for the specified operation, For instance， 
jetting the Symbol “ 口 " represents a blank character, the string ' 口 口 abc' with 
two jeading blank characters does not change under 'right' justification; be- 
comes abc 中 DO with 1eft' justification; and becomes the String ' 口 abc 口 ' 
with “ center， justification，Clearly these operations have no _ effect on a 


| string that does not contain any leading or trailing blanks, 





局 


”Teturns only the indices of the strings in STRS matching str exXactly. For exam- 
-ple,the statement 


区 >> 川 = 


Function strmatch, with syntax 


i = Strmatch( str' ，STRS) 


looks through the rows of the character array or cell arTay of Strings, STRS, to 
-find strings that begin with string str,returning the matching row indices.The 
”aljternate Syntax 


mm = Strmatch( Str ，STRS， 'exact ') 


Strmatch( max ，Strvcat( max'，'minimax ' ， maxImum' ) ) 


Ieturnsm= [1; 3] becauserows 1and3ofthe array formed by strvcat beglin 
with 'max' .On the other hand,the statement 


站 
到 


区 二 让 全 
沸 2 和 
ee 全 
caiS 二 FfInd 
人 
SA 私 . 


oa 
六 人 
人 
ae 二 
0 二 情 US t 
六 站 


人 


的 
We 


-Strmatch 
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SF 


>> 由 = Strmatch( max'，Sstrvcat( nax ， mininmax ， maximunm )， exact ' ); 


returnsm = 1,because only row 1 matches max exactly、 
Function strrep, with Syntax 


= Strrep( str1' ， str2 ， Str3 ) 


replaces all occurrences of the string str2 within String str1 with the string 
str3. 玉 any of str1,str2, or str3 is a cell arTay of Strings, thls function re- 
turns a cell array the Same size as Str1, str2,and str3,obtalined by perftorm- 
ing a strrep using corresponding elements of the Inputs. TIhe inputs must al] 
be of the same Size (or any can be a scajlar ce 由 .Any one of the strings can also 
be a character array with the correct nuImber of rows. 上 or exampjle， 


>> S = Image processSing and restoration，; 
>> Str = strrep(S， processSling ， enhancement |) 
Str = 


Image _ enhancement and restoration . 
Function Strtok, with Syntax 
t = Strtok(' str' ，delLim) 


retuIns the first token in the text String Str, that 1$, the first set of characters 
befere a delimiter in delim is encountered. Parameter delimls a vector Con- 
taining delimiters (e.g.,blanks, other characters, Strings). For example， 


>> Str = 'An image is an ordered Set Of pixels ) 
>> delinmn = [ Xx 】; 
>> 七 = Stprtok(StPr，delinm) 


本 一 
An 


Note that function strtok terminates after the first delimiter is encountered， 
(i.e.,a blank character in the example just given)j.If we change deljimto delin 
= ['x'j],thenthe output becomes 


>> 七 = strtok(Str，delinmn) 
七 三 
An image is an ordered Set of pi 


The next set of functions in Table 12.2 deals with converslions petwe6n 
strings and numbers. Function int2str, with SYntaxX 


stFr = Int2Sstr(N) 
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converts an integer to a string with integer format. The input N can be a single 
integeTr or avector or matrIx of integers. Noninteger inpnuts are rounded before 
Converslon. For example, int2str(2+3) isthestring '5' .For matrix Or vec- 
tor inputs, int2str Teturns a string matTixX: 


>> StP = 1Int2sStr(eye(3)) 


ans = 
1 0 0 
OO 1 0 
DO DO 1 
>> ClLassf(str) 


ans = 
Char 


Function mat2str, with Syntax 
Str = mat2strA) 


conyerts matrIx A imto a string, Sulitable for input to the eval function, using 
full preclslon. USsing the Syntax 


stPF = mat2strf(A，TDn) 


converts matrIX A Using n digits of precision, For example, consider the matITix 


>> A= [12;3 4] 
六 = 

1 2 

3 4 


The statement 


>>b = mat2str(A) 


produces 


[112 


where b is a String of 9 characters, including the Square brTackets, Spaces, and a 


4 semicolon.The command 
和 >> evalL(mat2str(A) ) 


”reproduces A.The other functions in this category have similar interpretations， 
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The last category In Tabjle 12.2 deals with base number conversions. For exy- 
ambpjle,function dec2base, with syntax 


StP = gdec2basel(d，base) 





converts the decimal integer d to the Specified base, where d mnust be a non- 
negative integer Smaller than 2^52, and base must be an integer between 2 
and 36. JIhe returned argument stris astring.For exampble,the following com- 
mand converts 2310 to base 2 and returns the result as a String; 


>> Str = dec2base(23，2) 
str = 
10111 
>> CLasSsfStr) 
ans = 
Chan 


Using the Syntax 


Str = dec2base(d，base，PTn) 


produces a representation with at least n digits. 


12.4.2 String Matching 


Jn addition to the string matching and comparing functions in Table 12.2, it is 
often Useful to have available measures of similarity that behave much like the 
distance measures discussed in Section 12.2. We ilustrate this approach using 
a Imeasure defined as follows. 

9UPpose that two region boundaries，4 and pb)，are coded into strings 
2102...0m and Dipb2 ...pDi,ITespectively. Let aw denote the number of matches 
between these two strings, where a miatch is said to occur in the Kth position 让 
kk 三 DTIhe number of symbols that do not maatch is 


8 = max(|d|,|2|l) -aa 


where jargl is the length (number of symbols) of the string in the argument, 
can be Shown that 8 = 0iLand only ifiaandpb areidentical strings. 
Asimple measure of similarity betweena and pp is the ratio 


R = 二 = 一 一 一 一 一 一 

B max(lal,jo) -ac 

This measure，proposed by Sze and Yang [1981], is infinite for a Perfecf 
match and 0 when none of the corresponding Symbols in a and pp match (a 18 
0 in thils case)， 





二 
2 
时 只 以 昌 
1 
四 和 
end 
了 
光 








清 i. 
党 
司 
可 
人 
下-; 
所 用 
0 
和 
站 隐 本 
全 
[Da 
去 
章 人 
了 
本 
下 
0 
2 


者 


人 
Ce 
站 
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Because matching is performed between cOITESPponding Symbols, it is re- 
qulred that all strings be“Tregistered”in Some position-independent manner in 
order for this method to make sense. One way to reglister two strings is to Shift 
one String with respect to the other until a maximum value of R is obtained 
This and other similar matching strategies can be developed using some of the 
stDng operations detailed in Table 12.2.Typically a more efficient approach 1S 
to define the same starting point for al Strings based on normajlizing the 
boundaries with respect to size and orientation before their String representa- 
ton 1S extracted. This approach is iustrated ip Example 12.3. 

Jhe folowing M-function computes the preceding measure of siimilarity for 
two character strings. 


stFsimiLlarity 
rr 


function R = Strsimilarity(a，b) 

5%STRSIMILARITY Computes a_ Similarity measure between two Strings， 

R = STRSIMILARITY(A，B) computes the Slimilarity measure，R， 
defined in Section 12.4.2 for strings A and 8，The strings do not 
have to be of the Same length，but if one is shorter than Other ， 
then it isS asSSumed that the shorter String has been padded With 
jleading bjlanks So that it is brought into the neceSSsary 
registration prior to using this function， only one of the 
strings can have blanks，and these must be Jeading andy/or 
trailJing blanks. Blanks are not counted When Computing the length 
of the strings for Use in the similarity measure . 


有 及 吕 昌 电 上 de op 


% Verify that a and b are character strings ， 
if ~ischar(a) | -~ischar(b) 
error( Inputs must be character strings。' ) 


| 5 Find any blank spaces . 
1 I=find(a 
。，J = find(b 


1 
= 
LI = length(I); LJ = Length(J)， 
ifLI-=0&LJU-=0 
errort Only one of the strings can contain blanks.') 


end 
% Pad the end of the appropriate string，TIt is assumed 
% that they are registered in terms of their beginning 
5 positions， 
a = (DO 
La = length(a); Lb = Jength(b ) ; 
if LI==0&LJ==0 
if La > 1Lb 
b = [b; blanks(La - Lb) ']， 
elLse 
a = [ai blanks(Lb - La) 1]; 
end 


:2 elseif isempty(I) 


Lb = length(b) ~ Length(d)， 
b = [by blanks(La ~ Lb - LU) ]; 


?10 
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JDbject 
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el1Se 

La = length(a) - Length(I) ; 

a= fa; blanks(Lb 一 La 一 LI) ]; 
end 


%5 Compute the Similarity measure ， 
I =find(a ==b) 

alpha = Length(I) ; 

den = max{La，Lb) - alphai 


if den == 
R = Inf; 
e]1Se 
BR = alpha/den; 
end ee 


蔷 Figures 12.3(a) and (d) show silhouettes of two samples of contaliner bot- 
ties whose Principal shape difference is the curvature of their sldes. For pur- 
poses of differentiation，objects with the cuUrvature characterlstics of 
Fig. 12.3(a) are Said to be from class 1.Objects with straight Sldes are said to be 
from class 2. The images are of Size 372 X 288 pixels. 

To iustrate the effectiveness of measure 尺 for differentiating between ob- 
jects of classes 1 and 2, the boundaries of the objects were approxlmated by 
minimum-perimeter polygons using function minperpoly (see Section | 过 2 
with a cejl size of 8. Figures 12.3(b) and (e) show the results, Then nolse wa8 
added to the _ coordinates of each vertex of the polygons using function 
randvertex (the listing is included in Appendix C), whjich has the Syntax 

[xn，yn] = randvertex(X，yY，npix) 
where x and y are column vectors containing the coordinates of the Vertices of 
a polygon, xn and yn are the corresponding noisy coordinates， and npix is the 
maximum number of pixels by which a coordinate 1 allowed to be displjaced in 
either direction. Five sets of noisy vertices were generated for each class using 
npix = 5.Figures 12.3(c) and (f) show typical results. 

Strings of symbols were generated for each class by coding tphe interior angles 
of the polygons using function polyangles (See Appendix C for the code listing): 


FIGURE 12.3 (a) An object. (b) Its minimum Perimeter polygon obtained using function minperpoly with a 
cell size of 8. (c) Atypical noisy boundary. (d)-(f The same Sequence for another object， 









>> angles = polyang1les(X，YyY) 
所 to the interior angle of that Vertex. This automatically Tegisters the strings (过 
。 the objects are not rotated) because they ajll start at the top, jeft Vertex in al 
人 images.The direction of the vertices output by minperpoly is clockwilse, so the 
| elements of s also are in that direction. Finally each s Was coOnverted from 3 
| string of integers to a character string Using the command 


Then a string, s, was generated from a given ang1les arTay by quantizing the 
angles into 4$?" increments, Using the statement 


>> S = fLoor(angleS/145) + 十; 





This yielded a string whose elements were nuUmbers between ] and 8, with 1 
designating the range 0? 三 8 < 45?", 2 designating the range 42 三 < 90 ， 
and so forth, where 9 denotes an interior angle. 

Because the first vertex in the output of minperpoly is always the toPp， left 
vertex of the boundary of the input, B,the first element of string S corresponds 


.>> s = Int2StPFl(s) 


In this example the objects are of comparable Size and they are all vertlcaj， 
六 80 normalization of neither size noOf orientation Was required. If the objects 
2 had been of arbitrary size and orientation, we could have aligned them along 
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rewieW 
ectlon Al of thls appendix contains a listing of all the fbnctions in the Image Processing Toolbox， 
nd ajl the new functions developed in the preceding chapters. The latter functions are referred to as 
DPCUM functions,aterm derived from the first letter ofthe words in the title ofthe book.Section A.2 
Tsts the MAILAB functions used throughout the book.All page numbers listed refer to pages in the 


Ook ,indicating where a function is first used and iliustrated. In some instances, more than one ljoca- … 


don is given,indicating that the function is explained in different ways, depending on the application. 

Some IPT functions were not used in our discussions. These are identified by a reference to onjline 

elp Instead of a page number.All MATLAB functions listed in Section A.2 are used in the book. 
:ach page number in that section ldentifies the first use of the MATLAB function indicated. 


尘 全 IPTand DIPUM Functions 


he following functions are loosely grouped in categories similar to those found in IPT documenta- 
tion. A new category (e.g., wavelets) was created in cases where there are no existing IPT functions. 





Eunction Category Page or Other 
and Nanmie bescription Location 
Image Display 
colorbar Dilsplay colorbar (MATLAB). onjline 
getimage Get image data from axXes. online 
ice (DIPUM) JInteractive coior editing. 218 
image Create and display Image object (MATLAB). online 
imageSsCc scale data and display as image (MATLAB)， online 
immovie Make movle from mujitiframe image. onjine 
imshow Display image. 10 
imview Display image in Image Viewer. online 
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montage 

movie 

rgbcube (了 DIPUMD) 
Subimage 
trueslize 

Warp 


Image file IO 


二 iLICominfo 
dicomread 
dicomwrite 
dicom-dict ,十 xt 
dicomuid 
Imfinfo 

Imread 

1mWPIite 


Jmage arithmetic 


imabsSsdiff 
imadd 
imcomp1lLement 
Imdizvide 
imlincomb 
ImmulLtiply 
Imsubtract 


Ceonmiefric transformatftions 


checCkerboard 
findbounds 
fl1iptform 
imcrop 
Imresize 
ImpPotate 
imtransfornm 
LIntline 


makeresamp1ler 
maketfornm 
pixeldup(DIPUM) 
tformarray 

tformfwd 

tforminv 
vistformfwd (DIPUM) 


Image registration 


Cpstruct2palrs 
Cpbp2tform 
Cpcorr 
CPSelect 
nornmxcorr2 


人 .1 强 IPT and DIPUM BRunctions 


Pisplay multiple inaage frames as rectangular montage. 


Play recorded movie frames (MATLABI). 
Dispiay a color RGB cube. 

Display multiple images in single figure. 
Adiust display Size of image. 

Display image as texture-mapped surface, 


Read metadata from a DICOM messapge. 

Read a DICOM imasge. 

Write a DICOM image. 

Text file containing DICOM data dictionary 
Generate DICOM unique identifier 

Return information about imasge fiie (MATLAB)， 
Read Image file (MATLAB)， 

Write Image fle (MATLAB)， 


Compnute absolute difference of two images. 

Add two images,or add constant to imasge. 
Complement image. 

Divide two images, or divide image by constant. 
Compnute linear combination of images. 

Multply two images, or multiply image by constant. 
subtract two images, or subtract constant from image. 


Create checkerboard image， 

Find output bounds for geometric transformation. 
Flp the input and output roles of aTFORM struct， 
Crop Image. 

Resize Image， 

有 Rotate Image. 

Apply geometric transformation to image， 
Integer-coordinate line drawing algorithm. 
(Undocumented IPT function). 

Create resampler structure. 


Create geometric transformation structure (ITFORMD) 


Duplicate pixels of an imapge in both directions. 
Apply geometric transformation to N-D array 
Appiy forward geometric transformation. 
Apply inverse geometric transformation. 
Visuajlize forward geometric transformation， 


Convert CPSTRUCT to valid pairs of control points. 


Infer geometric transformation from control point pairs. 


June control point locations using cross-correlation. 
Control point Selection tool. 
Normalized two-dimensional cross-correlation. 


online 
online 

193 
online 
Online 
onjine 


oniline 
onjine 
online 
onjlline 
onjline 
19 
14 
18 


42 

42 
42,.07 
42 

42, 139 
42 

42 


167 
Online 
onjline 
onjline 
online 

本 /2 

188 

43 


190 
183 
168 
online 
184 
184 
1835 


online 
191 
onjine 
193 
Online 
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Pixel vajues and statistics strsimilarity (DIPUM) 。 Similarity measure between two Strings. 309 
corr2 Compnute 2-D correlation coefficient. online x2majoraxis (DIPUMI) Align coordinate x with the major axis of a reglon. 457 
covmatrix (了 DIPUM) Compnute the covarlance matrix of a Vector population. 476 Image Compression 
imcontour Create contour plot of Image data. online 
下 Display histogram of image data. 77 compare (DIPUMI) Compnute and displiay the eIITOT between two matrices. 283 
| Determine pixel color values entropy (DIPUMI) Compute a first-order esttmate of 由 e entropy of a matTix， “58 
improfile Compute pixel-value cross-sections along line Segments. onjine 人 ee 8 0 CE 0 
和 Compute mean of matrix elements 75 huffman (DIPUMI) Build a variable-length Huftman code for Symbol SOUTCe. 290 
PixVal Display information about image PIXels， 17 2 和 光 人 ee 人 
二 Measure properties of image regions. 463 im2 】 peg2k (DIPUMD) Compress an image using 3 J FEG 2000 approximation， 2 
statmoments (DIPUMO) Compute statistical central moments of an image Philstogram. 1>5 人 人 人 9 2 
Std2 Compute standard deviation of matrix eleIments. 415 8 人 二 2 
jpeg2k2im(DIPUMI) Decode an IM2JPEG2K compressed Image. 330 
JImage analysis (includes segmentation, description, and recognition) Lpc2mat (DIPUMI) Decompress a 1-D lossless predictive encoded matIlix， 3 
bayesgauss (DIPUM) Bayes classifier for Gaussian patterns, 493 mat2nhuff (DIEUM) Hufftman encodes a matrlX. 228 
bound2eight (DIPUM) Convert 4-connected boundary to 8-connected boundary. 434 mat21pc (DIPUMD) Compress a matrix using 1-D lossjess predictive coding. 了 2 
bound2four (DIPUM) Convert 8-connected boundary to 4-connected boundary 434 quantize (DIEPUM) Quantize the elements of a UINTS matrIX， 310 
bwboundaries Trace Iegion boundaries. online Image enhancement 
bwtraceboundary Trace single boundarYy. online ， 
bound2im (DIPUMI) Conyvert a boundary to an Image. 435 adapthisteg Adaptlve hastogram equalization. onjine 
boundaiies (DIEPUN1 Tiace region boundaries. 434 decorrstretch Apply decorrelation stretch to multichannej 1mage. onjline 
bsubsamp (了 DIPUMI) Subsample a boundary. 435 gscale (DIPUM) Scajie the intensilty of the input image 76 
colorgrad (了 DIPUMI) Compnute the vector gradient of an RGB image. 234 8 9 2 Distogram 9 2 
colofseg(DIPUM) Segment a color image， 238 int rans (DIPUMI) Berform intensity transformations， 73 
connectpoly (DIPUMO) Connect vertices of a polygon， 433 St It 人 人 0 2 
diameter (DIPUMD) Measure diameter of image regions， 456 StretcChlLinm Find limits to contrast stretch an Image. online 
edge Find edges in an intensity image， 383 bnage noise 
fchcode (DIPUMI) Compute the Freeman chalin code of a boundary. 437 二 Add noise to an image. 106 
frdescp (DIEUMI) Compute Fourier descriptors imnoise2 (DIPUM) Generate an array of random numbers with specified PDF 148 
we et ee image threshold using Otsu's method. imnoise3 (DIPUM) Generate periodic noise, 152 
ough transform. 
houghlines (DIPUMD) Extract line Segments based on the Hough transform， 401 Linear and nonlinear spatial filtering 
houghpeaks (DIEPUMI) Petect peaks in Hough transform， 399 2 adpmedian (DIPUM) Perform adaptive median filtering. 165 
houghpixels (DIPUMI) Compute image pixels belonging to Hough transform bin. 401 扩 convmtx2 Compnute 2-D conyolution matrix. online 
ifrdescp (DIEPUM) Compnute inyerse Fourier descriptors. 457 1 dftcorr (DIPUM) Perform frequency domain correlation， 491 
imstack2vectors (DIPUM) Extract vectors from an 1mage stack， 470 四 dftfilt (DIPUM) Perform frequency domain fltering. 122 
invmoments (DIPUMI) Compute inyariant moments of image. 472 上 fspecial Create predefined filters. 99 
mahalanobis (DIPUM) Compnute the Mahalanobls distance. 487 加 medfilt2 Perform 2-D median filtering. 106 
minperpoly (DIPUMI) Compute minimurm perlimeter polygon. 447 1: imfilter Filter 2-D and N-D images. 92 
polyangles (DIPUMN Compute internal polygon angjles. 510 大。 ordfilt2 Perform 2-D order-statistic filtering. 105 
princomp (DIPUM) Obtain principal-component vectors and related quanitities. 二 77 把 spfilt (DIPUMD) Performs jinear and nonlinear spatial filtering. 1S9 
qtdecomp Perform quadtree decomposition. 413 | wiener2 Pertorm 2-D adaptive noise-Temoval filtering. onjine 
qtgetblk Get block values in quadtree decomposition，. 413 
qtsetb1Lk Set block values in quadtree decomposition， onjine Linear 汪 D 人 lter design 
randvertex (DIPUM) Randomly disPlace Polygon verticesS. 5S10 | freqspace Determine 2-D frequency response spacing (MATLAB)， onlimne 
regiongrow (DIPUMO) Perform segmentation by region grIOwIng. 409 忆 freqz2 Compute 2-D frequency reSponse. 2 3 
signature (DIPUM) Compute the signature of a boundary. 450 1 fsamp2 Design 2-D FIR filter using frequency Sampling. online 
specxture (DIPUM) Compute Spectral texture of an Image. 469 1 ftrans2 Design 2-D FIR 人 lter using frequency transformation， Online 
splitmerge (DIPUM) Segment an image using a Split-and-merge algorithm. 414 上 fwind1 Design 2-D FIR fiiter using 1-D window method. onjline 
statxture (DIPUMI) Compute statistical measures of texture in an image. 467 fwind2 Design 2-D FIR filter using 2-D window method， Online 
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imclose Close image. 348 


imclearborgder Suppress light Structures connected to image border， 300 


hpfilter (DIPUM) Computes frequency domain ee 1 Diiaieiniase 340 
1pfilter (DIPUM ) Compnutes frequency domain ljowpass filters. 131 ES 和 347 
Image debiurring (restorationm) imextendedmax Extended-maxima transform. onjline 
deconvblLind Debiur Image using blind deconvolution. 180 imextend edmin Extended-minima transform. online 
deconv1Lucy Deblur image using Lucy-Richardson method. 7 imfITI 上 i 人 and holes. 366 
deconvreg Deblur image using regularized filter， 175 imhmax 于 -maxima transform online 
deconywnr PDeblur image using Wiener filter. 171 imnnan 了 -IInima transform， 3 /人 
edgetaper Taper edges using point-spread function. 172 imimposemin Impose minima. 424 
otf2psf Optical transfer function to point-Spread function， 142 imopen Open 区 348 
psf2otf Point-sSpread function to optical transfer function. 142 imreconstruct Morphological Ieconstructlon， 303 
imregionalmax Reglicnal maxima. onlne 
Image transforms imregionalmin Regional minima. 422 
dct2 2-D discrete cosine transform. 321 imtophat Perform tophat filtering， 373 
dctmtx Discrete cosine transform matriX. 3 Watershed Watershed transform . 420 
- rojections to Parallel-beam. online 
0 onjine Morphological operations (binary images) 
生生 2 2-D fast Fourier transform (MATLAB). | 过 app1LylUt Ferform neighborhood operations using lookup tables. 3 
fftn N-D fast Fourier transform (MATITLAB). onjline bwarea Compute area of objects in binary image. onjline 
下 下 七 S 站 了 竺 二 Reverse quadrants of output of FFT (MATLAB). 112 bwareaopen Binary area open (remoye smail objects)， online 
idct2 2-D Inverse discrete cosine transform， onjine bwdist Compnute distance transform of binary image. 418 
ifanbeanm Compnute Inverse fan-beam transform. online bweu1lenr Compute Euler number of binary image. onjline 
ft2 2-D inverse fast Fourier transform (MATLAB). 114 bwhitmiss Binary hit-miss Operation， 332 
工 和 和 捷 站 N-D Inverse fast Fourier transform (MATLAB). onjiine bwlabel Label connected components in 2-D binary Image. 361 
iradon Compnute inverse Radon transform. online bwLabeln Label connected components in N-D binary image. onjline 
para2fan Convert parajlel-beam projections to fan-beam， onjine bwmorph Ferforrma morphological operations on binary imaage， 3356 
phantom Wenerate a head phantom image. online bwpack Pack binary image. online 
radon Compute Radon transform. onjine bwperinm Determine perimeter of objects in binary image， 445 
bwselect Select objects in binary image. onjline 
ee bwulterode Ultimate erosion. onjline 
wave2gray (DIPUMI) Display wavelet decomposition coefficients. 267 bwunpack Unpack binary image. 古人 
waveback (DIPUM) Perform a multi-level 2-dimensional Inverse FWT 了 2 endpoints (DIPUMD) Compute end points of a binary image. 354 
wavecopy (DIPUMI) Fetch coefficilents of wavejlet decomposition structure. 205 ae Construct lookup table for use with applylut 353 
wavecut (DIPUM) Set to zero coefficients in a wavelet decommposition structure， 204 
wavefast (DIPUMD) Perform a mnultilevel 2-dimensional fast wavelet transform. 255 | ， Structuring element (STREL) creation and manipulation 
wavefilter (DIPUM) wreate wavejet decomposition and reconstruction filters. 2 丰 。 getheight Get strel height. online 
wavepbaste (DIPUMD) Put coefficlients in a wavelet decomposition structure. 2065 ， getneighbors Get offset location and height of strel neighbors， 5 
wavework (DIPUM) 下 dit wavelet decomposition StrUctures. 202 扩 getnhood Get strel neighborhood. onjline 
wavezero (DIPUMID) Set wavejlet detail coefficients to zero， 7 j getsequence Get sequence of decomposed strels 342 
， 2 未 iiSsflat Return true for Hat strejls， onjine 
LO 人 和 全 利和 se . 二 Peflect Reflect strel about its center. online 
bestbik (Choose biock size for block processing，， 六 strel Create morphological structuring element. 341 
blkproc Implement distinct bjock PrIOcessSing for tmage. 2 -RansIate 下 aiSTSERiTe 局 
CoO12inm Rearrange matrix columns lnto blocks. 22 | 
CO1 于 it Columnwise nelghborhood operations. 罗 | Region-based processing 
im2col Rearrange Image blocks into columns， 32 本。 histroi (DIPUM) Compnute the histogram of an ROIin an image. 156 
nlfiltenr Perform general sliding-neighborhood operations. 96 人 poly2mask Convert ROI polygon to mask. 省 
Morphological operations (intensity and binary images) 时 ro:-color Select Teglon cf Iterest， based on color， online 
只 C 和 roifil1l moothjy interpolate within arbitrary region， online 
Conndef Default connectivity， 汪汪 373 | roifilt2 Filter a region of interest. online 
Imbothat SEEIOT QHGOAaL 和 en 区 全 roipoly Select polygonal region of interest. 156 
] 
| 
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Colormap manipulation 


prigGhten 
Cmpernmu 二 e 
Cmuniaque 
Colornmap 
imapprox 
rgbpPplot 


Color Space coniyerTsions 


applycfornm 
hsSv2rgb 

ICccread 
Jab2doub1e 
Jab2ulntte6 
Jab2uilnt8 
akecforn 
ntsCc2rghb 

rgb2hsv 

rgb2ntsc 
PFgb2ycbcnr 
Ycbcr2rgb 
rgb2hsi (DIPUM) 
hsi2rgb (DIPUM) 
Whitepoznt 
xyz2doub1e 
xyZz2uint16 


Array 0perations 


ClrCcshIft 

dftuv {(DIPUMI) 
Padarray 
paddedsize (DIPUM) 
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Brighten or darken colormap (MATLAB) 

Rearrange colors ln colormap. 

Find unique colormap colors and corresponding Image. 
Set or get color lookup table (MATLAB). 
Approximate indexed Image by one With tewer colors. 
Plot RGB colormap components (MATITLAB). 


Apply device-independent color space transformation， 
Conyert HSVYV values to RGB color space (MATLAB)， 
Read ICC color Profle. 

Convert Lasxbx color vajiues to class doub1e. 

Convert L*#asxbk color values to class uint16， 

Convert L*#asxbxf color values to class Uint8. 

Create device-independent color Space transform Structure. 
Convert NTSC values to RGB color space. 

Convert RGB values to HSV color space (MATLAB)， 
Convert RGB values to NTSC color Space， 

Conyvert RGB values to YCBCR color space. 

Convert YCBCR vajues to RGB color Space. 

Convert RGB values to HSI color Space. 

Convert HSILI values to RGB color space， 

Returns XYZ valiues of standard lliuminants. 

Convert 和 Y 之 color values to class do0ub1e. 

Convert XYZ color values to class uint16. 


Shift array circularly (MATLAB)， 

Compute meshgrid arTrays， 

Pad arfray. 

Compnute the minimum required pad Size for use In 上 上 工 s， 


Jmage types ang type conyersions 


ChangeCLasS 
dithenr 
gray21lng 
graySJliCe 
IW2bW 
Imn2doupbJe 
I12]java 
Im2]ava2d 
Im2uznt8 
Im2uUint16 
LInd2gray 
ind2rgdb 
abeL2rgh 
mat2gray 
rgb2gray 
rgb2jind 


Change the class of an image (undocumented IPT function). 


Convert image using dithering. 
Convert intenslty Image to indexed Image. 


Create indexed image from intensity Image by thresholdling. 


Convert image to binary Image by threshoilding. 
Conyvert image arITay to double Preclsion. 
Convert image to java Image (MATITLAB). 
Convert image to Java buffered Image object， 
Convert image array to 8-blt unsigned integers. 
Convert image array to 160-bit unsigned Integers. 
Convert Indexed image to intenslty 1mage， 
Convert indexed image to RGB image (MATILAB)， 
Convert abel matrix to RGB Image， 

Convert imatrix to intensity image. 

Convert RGB image or colormap to grayscale. 
Convert RGB Image to indexed image， 


oniine 
onjline 
onjline 
132 
198 
online 


online 
206 
online 
onhne 
onjne 
online 
onjline 
205 
206 
204 
205 
205 
212 
213 
onjline 
onlne 
onjlimne 


433 
128 

97 
117 


多 

199 
201 
201 

20 

20 
online 
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Miiscellaneous 
conway1laws (DIPUMI) 
manualIhist (DIPUMD) 
twomodegauss (DIPUMD) 
UIntJIU 


Toojbox prefereinces 


Iptgetpref 
iptsetpref 
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Apply Conway'`s genetic laws to a single pixel， 
Generate a 2-mode histogram interactively. 
Generate a 2-mode Gaussian function， 

Compute new array values based on 1ookup table， 


Get value of Image Processing Toolbox preference. 
Set value of Image Processing Toolbox Preference, 


ww MATLAB Functions 


JTJhe following MAILAB functions, listed alphabetically, are used in the book. See the pages indi- 
cated and/or online help for additional details. 





MATITLAB Function 


bar 
bin2decC 
blanks 
bpPreak 


《 


cart2p01 
Cat 

Cell 
ce]1 
CeJlJLdisSp 
celTIfun 
cell1Lp1Lot 
Cel1lstr 
chanr 
CircSshift 
COlLon 
colormap 
Computer 


cont1inue 
Conv2 


Description 


Absolute value and complex magnitude. 
Test to determine 过 all elements are nOnZeTO. 
The most recent anSwerI. 

Test for any nonzeTos. 

Axis scaling and appearance， 


Bar chart， 

Binary to decimal number conversion， 

Astring of bjanks. 

Terminate execution of a for loop or while looPp， 


Transtorm Cartesian coordinates to polar or cylindrical， 
Concatenate arTays. 

Round toward infinity. 

Create cell array. 

Display cell array contentts. 

Appjly afunction to each element in a cell arTay. 
Graphically display the structure of cell arrays. 

Create cell arTay of strings from character aITTay 

Create character arTay (String). 

Shitt array circulariy. 

Colon opPerator. 

set and get the current coljormap. 

Identify information about compauter on which MATILAB 
is running. 

Pass control to the next jiteration of for or whil1e loop， 
Two-dimenslional convyolution， 


3595 

87 

86 
onjline 


Online 
Onlmne 


了 Pages 


2 
46 
48 
46 
78 


77 
300 
499 

49 


451 

193 

114 

忆 92 
293,428 
428 

293 

499 

61, 499 
433 
31,41 
攻取 
48 


49 
257 
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ctranspose 


cuUmsSunmn 


D 


dec2hbase 
dec2bin 
diag 
djiff 

gr 

disp 
doub1e 


falSe 

于 eVval 

于 于 二 2 
fftShift 
fieldnames 
figure 
find 

下 JipdP 
fipup 
f1oonr 
for 

fuUlL1 


get 
getfield 
gl1cbal 
grId 
guIidata 
guide 


H 


helpP 
hist 
histC 
hold on 


Vector and matrix complex transpose. 
(ee transpose for real data.) 
Cumulative Sum， 


Decimal number to base conversion， 
Decimal to binary number conversion， 
Diagonal matrices and diagonals of a matrix. 
Diftterences and approximate derivatives. 
Display directoiy listing. 

Display text or arTay. 

Convert to double precision. 


Edit or create an M-file. 

Find eigenvalues and eigenvectors。 

Terminate for,while, Switch， try,and if statements 
or indicate jast index. 

Floating-point relative accuracy 

DisPlay error messapge. 

Execute a string containing a MATLAB exXxpIeSsSion'. 
Identity matrix. 


Create false array. Shorthand for 10gical(0)， 
Function evaluation. 
Two-dimensional discrete Fourier transform 


Shiftt zero-frequency component of DFT to center of Spectrum， 
Return field names of a structure, of PIoperty names of an object， 


Create a figure graphics object. 

Find indices and values of nonzero elements. 

Flip matrices left-right. 

Fiip matrices up-down. 

Round towards minus infinity 

Repeat a group of statements a fixed number of times. 
Convert Sparse matrix to full matrix. 


Get current axes handle. 

Get object properties. 

Get field of structure array. 

Detfine a global variable. 

Gnd lines for two- and three-dimensional plots. 
Store or Tetrieve application data. 

start the GUILayout Editor. 


Display help for MATLAB functions in Command Window 
Compute and/or display histogram， 

Histogram count. 

Retain the current plot and certain axis properties. 


41 


82 


508 
298 
239 
373 
284 

59 

24 


40 
478 
3J1 


48, 69 


50 
301 
494 


38,410 


415 
112 
112 
284 

18 
147 
472 
472 
114 

49 
390 


78 
2 
S$40 
292 
132 
9 
9328 


39 
150 
2 
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下 

ifft2 
ifftShif 二 
imag 
int16 
inpolygon 


interp1g9 
inv 

1S* 
IsScelLLStr 
isS10gical 


上 
1divide 
1ength 
Jinspace 
10ad 
109 
10g10 
109g2 
10gical 
LI00Kkfor 
LOwWenr 


内 


magic 
mat2Str 
人 fi3X 

fliea&Dn 
median 
mesh 
meshgrid 
mfilenanme 
站 工 站 
miInus 
mlidivide 
Power 
mrdizvide 
外 tzmes 


和 nan or NaN 
nargchk 

| nargzn 
nargout 
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Conditionally execute statements， 
Two-dimensional inverse discrete Fourier transform， 
JInverse FFT shift， 

Imaginary part of a complex number. 
Convert to Signed integer. 

Detect points inside a polygonal region. 
Request user inpat， 

Integer to string conversion. 

Convert fo signed integer. 

Conyert fo Signed integeT. 

Quick 1-D linear interpolation. 

Compute matrix inverse. 

See Table 2.9. 

Determine 这 item is a cell array of strings. 
Petermine ifiitem ls a logical array. 


Array Left division, (See mldivide for matrix left division .) 
Length of vector， 

Generate jinearly spaced vectors. 

工 oad workspace variables from disk. 

Natural logarithim. 

Base 10 Iogarithm. 

Base 2 logarithm. 

Conyert numeric vajlues to logical. 

9earch for specified keyword in all help entries, 

Convert String to lower case， 


GOenerate masgic Square. 

Convert a matrix into a string. 

Maxlimum element of an array， 

Average or mean value of arrays. 

Median value of arrays. 

Mesh plot， 

Generate X andY matrices for three-dimensional plots. 
The name of the currently running M-file. 

Minimum element of an array. 

Array and matrix Subtraction. 

Matrix left division. (See ldivide for array left divlsion.) 
Matnx power. (See function power for array power,) 
Matrix right division, (See rdivide for array right division.) 
Matnx multiplication. (See times for array multiplicationy， 


Not-a-number. 

Check number of input arguments, 
Number of input function arguments. 
Number of output function areuments. 


49 
]4 
14 
T13 

24 
440 

600 
500 

24 

24 
下 
403 

48 

48, 9501 

235 


41 
1 
2 
309 
68 
68 
68 
2 
40 
02 


38 
2307 
42 
362 
103 
132 
2 
3 
42 
41 
41 
41 
41 
41 


48 
7 
7 
7 
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ndinms 
neXxt+tpow2 
norm 
nume】] 


ones 


patch 
pernmute 


perSsistent 


PTI 

PLot 
plus 
poL2cart 
POW2 
Powenr 
PPIPnt 
prod 


R 


Fand 
Pandn 
rdivide 
Feal 

放 8&dILPnaxX 
Pea]lIn 
Pregexp 
Fegexp 
Pregexprep 
Fehn 
Fepmat 
PeShape 
PetuPn 
rot90 
rounag 


9 


SavVe 
Set 
Setfielg 
Shading 


Sign 
SiIngJle 
S1Ze 
SoOrt 
Sortrows 


Number of array dimensions. 
Next Power of two. 

Yector and matrix norm， 
Numpber of elements in an arTay. 


Generate array of ones. 


Create patch graphics object. 

Rearrange the dimensions of a multidimensional array. 
Define persistent variabjle. 

Ratio of a circle's circumference to its diameter， 
Linear 2-D Plot. 

Array and matrix addition. 

Transform polar or cylindrical coordinates to Cartesian. 
Base 2 Power and scaje floating-point numbers， 

AITay Power. (See mpower for matrix Power-.) 

Print to fue or to hardcopy device， 

Product of array elements. 


Uniformjy distributed random numbers and arrays， 

Normally distributed random numbers and arTays. 

Array ITight division. (See mrdivide for matrix right division.) 
Real part of complex number. 

Largest foating-point number that your compnuter can represent. 


smailest floating-point number that your computer can TepIesent， 


Miatch regular expTesslon. 

Match regular expression, jignoring case. 
Replace String using regular exXpression. 
Remainder after divislon， 

Replicate and tile an arTay 

Reshape arTay. 

Return to the invoking function. 

Rotate matrix multiples of 90 depgrees. 
Round to nearest integer， 


Save WOIKspace variabies to disk. 

%et object Properties. 

Set fiejd of Structure arTay. 

Set color shading properties. We use the interp mode 
ln the book， 

Signum function， 

Convert to Single Precislon， 

Return array dimenslons. 

Sort elements in ascending order， 

Sort Tows in ascending order. 


117 
485 
3 


38 


196 
486 
353 
48 
80 
41 
451 
300 
41 
2 
98 


38, 145 
38, 147 
41 

中 5 

48 

48 

502 
303 
303 
2530 
264 
300 

49 

94 
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Sparse 
Spline 
SPFintTf 
S 起 6 前 
Strx 
Str2nu 由 
StrCcat 
strcmp 
strcmpi 
strfind 
strjust 
strmatch 
strFncmp 
strncmpi 
strread 
strrep 
StrtoOK 
Strvcat 
Subp1lot 
Su 外 

SUrf 
SWitch 


七 BXt 
tic，toc 

十 ImesS 

二 it】e 
transpose 
true 
try,.,,,Catch 


U 


ULIControl 
Unt16 
UzInt32 
uUinta8 
ULIFeSuUme 
ULWaiit 
uUmznus 
UpP1LUS 
Unidue 
Upper 


V 


varargin 
Vararout 
VerSsion 
VIeW 
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Create Sparse InatTiX， 395 
Cublc spline data interpojlation. 218 
Write formatted data to a String. 2 
Plot discrete sequence data， 79 
String OperTations. See Table 12.2. 500 
String to number conveTSsion， 00 
string conicatenation， 303 
Compare strings. 02, 504 
Lompare Strings 1gnoTing case， 504 
Find one String within another. 505 
Justify a character arTaYy 205 
Find posslble matches for a String. 305 
Compare the firstn characters of two Strings. 504 
Lompare first n characters of strings jgnoring case. 310, 505 
Read formatted data from a stTring. 多 
stIing Search and repPlace. 300 
First token in String. 500 
Yertical concatenation of strings. 304 
Subdivide figure window into afrray of axes Or SuUbplots. 249 
Sum of array elements. 335 
3-D Shaded surface pilot. 134 
Switch among Several cases based on expression. 49 
Create text object. 79 
Stopwatch timerI. 537 
AITray maultiplication. (See mtimes for matrix mujtiplication.) 41 
Addtitle to current graphic. 79 
Matrlix or vector transpose, (See ctranspose for complex data.) 30, 41 
(Create true array. Shorthand for 10gicalL1(1+)， 38, 410 
See Iable 2.11， 49 
Create User interface control object. 534 
Convert to unsigned integer. 24 
Convert to Unslgned integerT. 24 
Convert to unsigned integer. 24 
Control Program execution. 340 
Controil progranml execution， 540 
Unary minus, 41 
Unary Plus. 4] 
Unique elements of a vector. 433 
Convert String to Upper case. 62 
Pass a variable Dumber of arguments. 的 
Return a variable number of arguments. 2 
Get MATLAB version number. 48 
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WW 


warning 
While 
whitebg 
Whos 


关 


Xlabel 
X 了 了 由 
XO 
XtICK 


Yabel 
Y 了 Im 
ytICK 


Z 


Zeros 


Display warning message. 


Repeat statements an indefinite number of times. 


Change background color， 
List variabjles in the workspace. 


Label the x-axXis. 

Set Or query X-axis limits. 
Exclusive or. 

9et horlzontal axis tick. 


Label the y-axis. 
9et OT qUery y-axXis limits. 
9et vertical axis tick. 


Generate array of zeros. 


人 Devaeiadagaigl: 训 ee 
pe 


set see 


in thlis appendix we develop the ice jnteractiye color edifing (ICE) function 
introduced in Chapter 6. The discussion assumes tamiliarity on the part of the 
Ieader with the material in Section 6.4. Section 6.4 PITovides many examples of 
usSInDg ice in both pseudo- and full-color Image processing (Examples 6.3 


5 through 6.7) and describes the ice cajlling syntax，input parameters， and 


graphical interface elements (they are summarized in Tables 6.4 through 6.6). 
The power of ice is its ability to let users generate cojor transformation curves 
interactively and graphically，while displaying the impact of the generated 
transformations on images in real or near real time. 


(Creating ICE's Graphical User Interface 


MATLAB'S GrapPpical User Lierjoce Developmaent E1yviroppazerzi (GUIDE) 
Provldes a rich set of tools for Incorporating 8raDpHical rser ilerjacey (GUIS) 
in M-functions, Using GUIDE, the Processes of (1) jaying out a GUI (i.e., its 
buttons, pop-up menus， etc.) and (2) Programming the operation of the GUI 
are divided conveniently into two easilly managed and relatively Independent 
tasks，Ihe resulting graphical M-function is composed of two identically 
named (lgnoring extensions) files: 


二 A file with extension .fig, called a lLC-jile, that contains a complete 
graphical description of all the function's GUI objects or elements and 
their spatial arrangement, A_ FIG-file contains binary data that does not 
need to be parsed when the associated GUILbased M-function is execnut- 
ed. Ihe FIG-file for ICE (ice. figj is described later in this section 

2 Atfile with extension .m, called a GCT -file,which contains the code that 
controls the GUI operation, This file incliudes functions that are called 
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GURE B.1 

he GUIDE 
ayout Edltor 
IOckup of the 
> 已 GUJT 


when the GUI is launched and exited, and caJJpacK jactioms that are 
executed when a user interacts with GUI objects 一 for exXample， when a 
button is pushed.The GUI Mr-file for ICE (ice.m) is describped in the next 


Sectlon， 
Tolaunch GUIDE from the MATLAB command window,tyPe 


guide fiJlename 


where filename is the name of an existing FIG-fijle on the _ current path. 下 
filename is omitted, GUIDE opens a new (ie.,blank) window， 

Figure B.1 shows the GUIDE Zayott Editor (launched by entering guide 
ice at the MATLAB >> prompt) for the Interactive Coier Editor (CE) lay- 
out. The Layout Editor is used to select， place, size, align, and manipujate 
graphic objects on a mock-up of the useT interface under development lbhe 
buttons on its left side form a Cormzpoment Pajette containing the GUI objects 
that are supported- 一 Prspn Brzttons, 7288Le Brtomns,Radio Butons, CRecKpoxes， 
EdirText,Siafic Texfs, Sliders, Fraaey， 7Litpoxes, PopLD Menls,and 4xes. Each 
object is similar in behavior to lts standard Windows' counterpart. And any 
combination of objects can be added to the fgure object in the layout area on 
the right side of the Layout Editor. Note that the ICE GUI includes checkbox- 
es (Smooth, Clamp Ends, Show PDF, Snow CDF, Map Bars, and Map Image), static 
text (Component: ”CuUIVe ， ., a frame outlning the _ curve controjs, two 
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push buttons (Reset and Reset AlL1),apopup Imenu ftor Selecting a color trans- 
formation curve, and three axes objects for displaying the selected curve (with 
associated control points) and its effect on both a gray-scale wedge and hue 
wedge.A hierarchical list of the eljements comprising ICE (obtained by clicking 
the Object Browser button in the task bar at the top of the Layout Edqitor) 
shown in Fig. B.2(a).Note that each element has been given a unlque name OF 
tag. For example, the axes object for curve display (at the top of the list) is as- 
signed the jidentifier curve_axes [the identifier is the first entry after 也 e Open 
parenthesis in 所 iig. B.2(a). 

Tags are one of Seyeral properties that are common to all GUI objects. A 
scrollable list of the properties characterizing a Specific object can be obtained 
by selecting the object [in the Object Browser list of Fig. B.2(a) or ljayout area 
of Fig. B.1 using the Selection Tooll and clicking the Property zaspector button 
on the Layout Editor's task bar. Figure B.2(b) shows the list that is generated 
when the figure object of Fig. B.2(a) is selected. Note that the figure ob- 
ject's Tag property [highlighted in Fig. B.2(b)] is ice. This Property ls Impor- 
tant _ because GUIDE uses it to_ automatically generate figure callback 
function names.Thus, for example,the WindowButtonDownFcn Property at the 
bottom of the scrollable Property Inspector window, which is executed when a 
mouse button is pressed over the figure window， is assigned the narme 
ice WindowButtonDownFcn. Recall that callback functions are merely 
M-functions that are executed when a user interacts with a GUI object, Other 
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2 udconrrol 【cext3 "0ucputyj | ShareColors ee 区 jon 
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: 了 
一 较 uiconcrol (3aooth_ checkbox “Snpoth') 和 
IContextMenu 












[uicontrol (zeset_puUshbuccon “Resec"】 站 e 
1 和 ULCORSSLTOL 【FeXt4 "CUVe"”) 和 人 nls 人 站 国 刘 
本 网 本 | | 一 UserData， 六 本 间 呈 double 夺 Tayl 
2 uiContLID1 (ouLEPUL LeEXC “”) 和 人 隐 人 
中 攻 <-WindowButtonDownFcn 









… 同 uiconcrol (3Lope_checkbox "Clap Ends“》 


Fe Window 理 uton 草 otitonFcn 
… 涡 julcontrol (resecalL_pushPHUEEon “ReSet 及 1 “] ， 


icetjice 





响 uiconcrol (pdf_checkbox “Show EDF ”|) | 
“ 回 uieonczol fcdE_checkhox "Show CDF”| 1 
Econtrol 人 blue_zexc “"] | 
TconEEOL 【geED_EexE “"]】 | 
ConEtOL 《LIeQ_EEXC “”] | 
: 提 G ULECOREFOL {LextLD “3etudGo-Ccolor Bar”] 1 
ip uicontOL (LextLL“TFuil-coLor Bar“1 | 
只 niconcrol (aapbar_checKhox “Map BarS ) | 


uiEcontro1l (aapimage checkbox “HKap 工 电 如 e“} 
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_wWindowButtoniMotionFcn'.gcbo,0guidatatgcbo? 
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。 HGURE B.2 (a)The GUIDE Object Browser and (b) Property Inspector for the ICE figure" object 
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CLTDE Senerafed 
YIG11028 MMRie 






function 
function 
function 
function 
function 
function 
funct1Lon 
function 
function 


notable (and common to all GUI objects) Properties incliude the Position 
and Units Properties, which define the size and locatiou of an object 

Finailly we note that some properties are Unique to particular objects. A push- 
button object, for example, has a Callback property that defines the function ， 
that 1$ executed when the button is pressed and the String property that deter- 
mines the button's label, The Callback property of the ICE Reset button js， 
reset_pushbutton_ Callback [note the incorporation of its Tag Property from 
Pig. B.2(a) in the callback function nameji its String Property is“Reset". Note， 
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component_popup_Callback(hobject，eventdata， hand]es) 
Smooth_checkbox_Callback(hObject， eventdata，hand1les ) 
reset_pushbutton_CalLlback(hObject， eventdata，hand1les ) 
Slope_checkbox_Callback(hobject， eventdata，hand1les) 
resetall_pushbutton_Callback(hobject， eventdata，hand1les ) 
pdf_checkbox_Callback(hobject， eventdata，hand1les) 

cdf _checkbox_Callback(hobject， eventdata，hand1es) 
mapbar_checkbox_Callback(hobject， eventdata，handles ) 
mapjimage_checkbox_Callback(hobject， eventdata，hand1les ) 


howeyer that the Reset pushbutton does not have a WindowButtonMotionFcn 


propertyi it ls Specific to “fgure”objects， 





2 Programming the ICE Interface 


When the ICE FIG-file of the previous section is first saved or the GUI is first_” 
IUD (e.g., by clicking the Razza button on the Layout Editor's task bar),GUIDE 
generates a starting CL7 M-jiie called ice.m.This file, which can be modified 
Using a standard text editor or MATLAB's M-file editor, determines how the  : 
interface Tesponds to user actions. The automatically generated GUI M-file 


for ICE is as follows: 


function varargout = icelvarargin) 


% Begin initialization code - DO NOT EDIT 
guUlL_ Singleton = 1 
gul_State = Struct( gui _Name ' ， mfiilename，，,， 


guil Singleton' ， gui Singleton， 
gui openingFcn ， Qice 0peningFcn， 
gui outputFcn ， 8ice 0utputFcn， 
gui LayoutFcn' ， []，,,， 
gul_Callback ' ， 站 入 
if nargin & ischarf(varargin{f1}) 
gui_State,gui_Callback = str2func(varargin{f1y) 
end 
f nargout 
[varargout{1:nargout}] = gui_ mainfcn(gui_ State，vararginf:}); 
eJSe 
gui_mainfcn(gui State，vararginf:}); 
end 
s End initialization code - DO NOT EDIT 
function ice_0peningFcn(h0bject，eventdata，handles，varargin) 
handles.output = hObject， 
guidata(hobject，handles ) ; 
% Uiwait (hand1es ,figure1) ; 


function varargout = ice_0utputFcn(h0bject，eventdata， hand1les) 
varargout{f1) = handles.output; 
function ice_WindowButtonDownFcn(hobject，eventdata，handles) 


function ice_WindowButtonMotionFcn(hobject，eventdata，hand1es) 
function ice_WindowButtonUpFcn(hobject，eventdata， handjles ) 






汪 瑟 和 
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This automatically generated file is a useful startIing point or prototype for the 
develjopment of the fully functional ice interface. (Note that we have stripped 
the fle of many GUIDE-generated comments to Save Space.) In the sections 
that follow, we break this code into four basic sections: (bthe initialization code 
between the two “DO NOT EDIT”comment lines， (2) the figure opening and out- 
put functions (ice_0peningFcn and Ice_QutputFcn), (3) the figure callback 
functions (ie.，the 1ce_WindowButtonDownFcn，ice _WindowButtonMotion- 
Fcn,and ice_WindowButtonupFcn functionsj, and (4) the object callback func- 
tions 《e.g.，reset_pushbutton_Callback)， When consldering each section， 
compjletejy developed versions of the ice functions contained in the section are 


given, and the discussion is focused on features of general interest to most GUI 


M-flle developers. The code introduced in each section will not be consolldated 
(dor the sake of brevity) into a single comprIehensive listing of ice.m.Itis intro- 
duced in a Piecemeal manner. 

The operation of ice was described in Section 6.4. It is also SumImaIIZzed in 


_the folowing Help text block frorn the fully developed ice.m M-function: 


和 多 ICE Interactive Color Editor.， ice 


和 
名 OUT = ICE('Property Name ， "Property Value'，,.,) transforms an Peip iexfbiock of 
5 image S$ Color components based on interactively specified mapping jlL YersioP 

各 functions，Inputs are Property Name/Property Value pairs: 

各 

竺 Name Value 
和 

和 Image: An R68 or monochrome jinput image to be 

和 transformed by interactively specified 

和 mappings， 

多 “Space The Color Space of the components to be 

和 5 modified，Possible values are 二 

和 hsi ，'hsv' ，'ntsc' (or yiI9 )， "ycbcr' ，When 

和 omitted，the RGB color space is assumed， 

务 Wait， If on” (the default)，0OUT is the mapped input 

和 Image and ICE returns to the calling function 

上 or Workspace When closed，If 'off'，0OUT is the 

和 5 handJje of the mapped input ;image and ICE 

多 returns immediately， 
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Ge oa oo 吕 


oo 


ce oo o 避 


ee ce 


oo 避 oo ooce 


co 


op oo2 


oo oo 吕 


co cb 吧 品 


Go oo 品 


op op oaeoo oo co oo oo oo 吧 co oo oo 


QOS 


EXAMPLES : 


ice 0R icef('wait' ，' off ) 


ice( image ，T) 


icel' inage'，f，'Space ， hsv ) 
g = icel( imnage 二) 
g = ice( imnage'，Tf，'wait ， 0off )) 


% Denmo USen interface 
% Map RGB or mono image 
% Map HSV of RGB image 
% Return mapped lmage 
%5 Return its hand]1e 


ICE displays one popup menu Selectable mapping function at 8 
time，Each jimage component is mapped by a dedicated curve (e.g.， 
R，G，or 8B) and then by an alI-Component Curve (e.g,.，R6GB)， Each 
curve's control points are depicted as Circ1e6s that Can be moved， 
added，or deleted With a two- or three-button mousSe: 


MouSe Button 


加 丰 亩 和 四 四 党 本 和 四 mm 只 亚 四 


Lef 
Midd]1e 


RIght 


Edjiting 0peratiIion 


Move control point by pressing and dragglng， 
Add and position a control point by pressing 
and dragging， (Optionally Shift-Left) 

Delete a control point， (Optionally 
ControlL-Left ) 


Checkboxes determine how mapping functions are Computed，whether 
the ;input image and reference psSeudo- and fu11L-color bars are 
mapped，and the displayed reference CUFVe information (8e,g,， 


PDF ) ; 
Checkbox 


Smooth 


CLamp Ends 


Show PDF 


Show CDF 


Map Image 


Map BarSs 


Function 

Checked for cubic spLine (Smooth CUrVe) 
interpolation。. If unchecked，piecewise linear， 
Checked to force the Starting and ending CuUrve 
slopes in cubic spline interpolation t0 0，No 
effect on piecewise linear. 

Display probability density function(S) [ 寺 ,86,， 
histogram(sS)] of the image Components affected 
by the mapping function， 

Display cumulative distributions functionf(S) 
instead of PDFs， 

<Note:' Show PDF/CDF are mutuallLy exCluSivVe.> 
If checked，jimage mapping is enab1ed; 681Se 
not . 

If checked，pseudo- and ful1-color bar mapping 
is enabLed，else display the unmapped pars (a 
gray wedge and hue wedge，respectiyvely) ， 








Mapping functions Can be initialized Via puShbuttons: 


多 

务 
:和 Button Function 
生生 入 生生 生生 生生 生生 生生 全 生生 生生 生生 入 入 全 入 生生 生生 入 生生 生生 全 宝生 
和 ResSet Init the currently displayed mapping function 
5 and uncheck all Cufrve parameters， 
和 Reset AI Initialize al mapping functIons， 


了 .2.4 Initialization Code 


The opening section of code in the starting GUI M-file (at the beglinning of 
_Section B.2) is a standard GUIDE-generated block of initialization code. Its 
- purpose is to build and display ICE'S GUI using the Mrfile's companlon 下 IG- 
file (see Section B.1) and control access to all internal M-file functions. As the 
_enclosing“D0 NOT EDIT”comment lines indicate, the initiajizatlon code 
should not be moditied. 卫 ach time icels called,the initialization biock builds 
a Structure called gui_State, which contalns information for accesSsing Ice 
functions. For instance, named field gui_ Name (ie., gui_State.gui Name) 
contains the MATLAB function mfilename,，which returns the name of the 
currently executing M-file. In a similar manner, fields gui_ OpeningFcn and 
gui_OutputFcn are loaded with the GUIDE generated names of Ice s Open- 
ing and output functions (discussed in the next Section). 了 an ICE GUI object 
is activated by the user (e.g.,a button is pressedj,the name of the object's call- 
back function is added as field gui_Callback [the callback's name woujd 
have been passed as a string in varargin(1+)j. 

After structure gui_State is formed, it 18 passed as an input arguInent， 
along with varargin(i),to function gui_mainfcn. TIhis MAILAB function 
handles GUI creation, layout, and callback dispatch. For ice,it builds and dis- 
plays the user interface and generates all necesSary cajls to its opening, Outpnut， 
and callback functions. Since older versions of MATLAB may not include thbis 
function, GUIDE is capable of generating a stand-alone version of the normal 
GUI M-fiie (ie., one that worKs without a FIG-file) by sejecting 了 xport . ，， 
from the File menu. In the stand-aione version,function gui_mainfcn and two 
Supporting routines, ice_LayoutFcn and 1ocal_openfig,are appended to the 


normally FIG-fiile dependent Mrfile. The role of ice_LayoutFcn 1S to create 


the ICE GUI. In the stand-alone version of ice,it begins with the Statement 


hh1 = figure(..， 


Units'， "characterSs' ，,，， 


| 'Color' ，[0.87843137254902 0.874509803921569 0.890196078431373] ,，. ,. 
Colormap ，[0 0 0.5625;0 0 0.625;0 0 0.6875;0 0 0.75;，,， 


2 
人 


0 0 0.8125;0 0 0.875;0 0 0.9375;0 0 1;0 0.0625 1;.,. 

0 0.125 1;0 0.1875 1;0 0.25 1;0 0,3125 1;0 0.375 1;..， 
0 0.4375 1;0 0.5 1;0 0,5625 1;0 0.625 1;0 0.6875 1;.,. 
0 0.75 1;0 0.8125 1;0 0.875 1;0 0.9375 1;0 1 13;，,， 
0.0625 1 1;0,125 1 0.9375;0.1875 1 0,.875;,,， 


多 
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0.25 1 0.8125;0.3125 1 0.75;0.375 1 0.6875j; . .,， 
0.4375 1 0.625;0.5 1 0.5625;0.5625 1 0.5;..， 
0.625 1 0.4375;0.6875 1 0,375;0.75 1 0.3125;.,， 
0.8125 1 0.25;0.875 1 0.1875;0.9375 1 0.125;,,， 
1 T0.0625,1 -0 .0.93975.0110,.875 031 0.81295 0 
1 0.75 00.6875 .01 .0.625.0)1 0.5625 071.0.9 0 .vs。 
1 0.4375 0;1 0.375 0;1 0.3125 0;1 0.25 0).,， 
10.1875 0;1 0.125 0;1 0,0625 0;1 0 0j0.9375 0 0 ... 
0.875 0 0;0.8125 0 0;0.75 0 0;0.6875 0 0;0,.625 0 0;i，,， 
055625 :0 5 
IntegerHandJe'  ， "off ，... 
InvertHardcopy ，get(0， defaultfigureInvertHardcopy ) ，，,， 
MenuBar ， none ，,.， 
Name ，'ICE - Interactive Color Editor ，,，， 
-NumberTit1e ， 0fTf ，，，,， 
'PaperPosition' ，get(0， 'defaultfigurePaperPosit1ion )，.,， 
"Position'，[0.8 65.2307692307693 92.6 30.0769230769231] ,.,， 
Renderer ，get(0，'defaultfigurehenderer )，.,， 
RendererMode ， manual ，.，,. 
风 indowButtonDownFcn ， "ice( ice WindowButtonDownFcn  ，gcbo，[]，，,， 
guidata(gcbo) ) ，, .， 
WindowButtonMotionFcn ， Ice( ice WindowButtonMotionFcn “ ，gcbo,，.， 
[] ，guidata(gcbo)) ，, .， 
WindowButtonuUpFcn ， ice( ice WindowButtonuUpFcn  ，gcbo，[],..,， 
guidata(gcbo)) ，,.， 
HandJleVisibility ， callback' ，,，,， 
Tag ， 1C8 ，,，,， 
UserData ，zeros(1;0) ) ; 


to create the maln figure window.GUI objects are then added with statements jike 


h13 = Uicontrol( .,，， 

本 

UnitSs ， normalized ，.，. 

Callback ， ice( reset_pushbutton_ Callback “ ，gcbo， 
guidatal(gcbo)) ，,,， 

-FontSiIze ，10,，.，,， 

LiISstboxTop'` ，0,，... 


[] 


-Position'` ，[0.710583153347732 0.508951406649616. ,， 
0.211663066954644 0.0767263427109974j ，,,， 

String ， Reset ' ，.. ， 

Tag ， Feset _ pushbutton ) ; 


which adds the Reset Pushbutton to the figure. Note that these statementfs 
specify explicitly properties that were defined originally using the Property In- 
spector of the GUIDE Layout Editor. Finally, we note that the figure funcC- 
tion was introduced in Section 2.3: uicontrol creates a user interface control 






和 % Set ICE gl1obals to defaults， 





E 


| handles,curve = 'set1 
生 。 handles.cindex = 1 
和 handjles， 


， handles,output 
，handles ,df = []; 

: handles,colortype = 
， handles.input = [; 

， handles ,imagemap = 1， 
2 handles.barmap = 1 
:handles,.graybar = 『[]， 
3: handles.colorbar = []; 
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(ie, GUI object in the current figure window based on property name/value 
pairs (e.g.，Tag' Plus 'reset_pushbutton') and returns ahandle to it 


8.2.2 The Opening and Output Functions 


The farst two functions following the initialization block in the starting GUI 
Mr-file at the beginning of Section 昌 .2 are called opeming and oripuL 记 actio7y， 
Tespectively. Jhey contain the code that is executed just before the GUI is 
made Visible to the user and when the GUI returns its output to the command 
line or calling Toutine. Both functions are passed arguments hO0bject， 
eyentdata, and handJjes. (Hihese arguments are also inputs to the callback 
functions in the next two sections.) Input hobject is a graphics object handle， 
eventdata ls reserved for future use, and handles is a structure that provides 
handjes to interface objects and any application specific or user defined data. 
To Implement the desired functionality of the ICE interface (see the Help 
text), both ice_0peningFcn and ice_OutputFcn must be expanded beyond 
the “barebones”versions in the starting GUI M-file. The expanded code is as 
follows: 


function ice_0peningFcn(h0bject，eventdata，handles， varargin) 
多 When ICE is 0pened，perform basic initialization (e,g,， Setup 
区 glLobals，,,.) before 让 is made visible， 


Ice_0OpeningFcn 
0 隐 


Fropz ipe Paat 
dd-Fe. 


handles.updown = “none'， Mouse Updown State 
handles.plotbox = [0 0 1 1T]; Plot area parameters in pixe1S 
handles,set1 = [0 0; 1 1]; Curve 1 control points 
handles,Set2 = [0 0; 1 1]， Curve 2 control points 
:handles,set3 = [0 0; 1 1T]; Curve 3 control pointSs 
-handles,set4 = [0 0; 1 1]; Curve 4 control points 


Structure nanme of Selected CUPve 
Index of Selected Curve 


所 


】 


鲍 node = 0; Index of Selected control point 
.handles,below = 1 Index 0f node below control point 

记 handles,above = 2， Index of node above Control point 

起 handles.smooth = [0; 0; 0， 0]; Curve Smoothing States 

| handles,Slope = [0; 0; 0; 0]; curve end S10pe control StateSs 

，， handles,cdf = [0; 0; 0; 0]; Curve CDF states 

4 hand]les,pdf = [0; 0; 0; 0]; CUrve PDF States 


output image handje 
Input PDFs and CDFSs 
Input image Color Space 
Input image data 

Image map enab1e 

Bar map enabjle 

Pseudo (gray) bar image 
多 COIor (hue) bar imnage 


[] ; 
0 


oo oo 


s Process Property Name/Property Value input argument pairs， 


Wait = “0n '; 





536 ”Appendix8 演 ICE and MATLAB Graphical User Interfaces 


if (nargin > 3) 
for = 1:2:(nargin - 3) 
if nargin 一 3 == 工 
break ; 
end 
Switch 1ower(varargin{I}) 
Case ， image 
if ndims(vararginfi + 1}) == 
handles.input = Varargin{I + 1} 
elseif ndims(varargin{fi + 1}) == 
handles.input = cat(3，varargin{fi + 1}，，,，,， 
vararginfi + 1}，varargin{tI + 1 ) 
end 
handles ,input = double(handles.input) ; 
inputmax = max(handles.input( 2:)) 
计 inputmax > 255 
handles,input = handles,input / 65535 ; 
elSselif Inputmax > 1 
handles,input = handles.input / 255 
end 


Case Space 
handles,colortype = lower(vararginfI 二 1) 
SWitch hand1es .CoOLOrtype 
CasSe “CCmy 
list = {ICMY' 'Cyan' Magenta “YeLLOW 】) 
case fntsc' ， yidq } 
list = {YIQ: 'Luminance' Hue' “Saturation 了 
handles.colLortype = “ntsc 
Case “YCbcP 
list = {YCbcr' "Luminance “8B1LUe ”.，， 
'Difference' 'Red Difference j}; 


case :hsv， 

list = {f'HSV' :Hue' "Saturation Value 上 
Case “hsil 

List = {f'HSI' 'Hue' 'Saturation' “Intensity 二; 
OtherW1Se 


list = {'RGB' 'Red'  'Green” Blue }; 
handles,colortype = “rgb 

end 

set{handles.component_popup， "String ，]1ist) ; 


Case “Wait- 
Wait = LIower(varargin{f1L + 1)}) 
end 
end 
end 


% Create pseudo- and full-color mapping bars (grays and hues) ，Store 
% a color space Converted 1x128x3 line of each bar for mappilng， 

X 症 二 人 和 X=0:1/6:1); 全 

是 生 人 0 革 下 省 寺 

gb = frepmat(xi，[1 1 9 cb = interp1dq(X，y， XI ) 
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cb = reshape(cb，f1 128 3]) 

计 ~StFcmp(handles.colortype， rgb ) 
gb = eval(['rgb2' hand1es.colortype (gb) ]) 
cb = eval(['rgb2' hand1es.coLortype (cp) ] 


min({255，gb) ; 
inft255，cb) ; 


gb = max(0，gb) gb 
cb = max{(0O，cb ) cb 
hand1les,colorbar = cb 


所 


handles.graybar = gb; 


% Do color space transforms，CLamp to [0，255]j，compute histograms 
% and cumulative distribution functions，and create Output figure， 
计 Sizefhandles,input，1) 
if -strcmp(hand1les.colortype， rgb ) 
handles.input = eval([' rgb2' handleS.COLortype 
(handles ,input) ]); 
end 
handles.input = round(255 * handles,input) 
handles.input = max(0，handles ,input) 
handles ,input = min(255，handles,input) 
for 1I1 = 1:3 
color = handles,input(:，:，I 工 ) ) 
df = hist(color(:)，0:255) ; 
handles.df = [handles.df; df / max(df(3))]) 
df = df / sunm(df(:)); df = cumsunm(df)i 
handles,df = [handles,df; df]; 
end 
figure 
end 


handles.output = gcf ; 


% Compute ICE's Screen position and display image/graph 
set(0，'Units' ， pixelS ) ) SSZ = get(0，'ScreenSslze ) 
set(handles,ice，'Units'， ' pixelS ) 
uisz = get{fhandles.ice，" Position ) ; 
if sizelhandles,input，1) 

fsz = get(hand1es.output， "PoSsition ) 

bc = (fsz(4) - Uisz(4)) /3 


DC 
bc = bc + fSz(2); 
el1Se 
bc = fsSz(2) + fsz(4) - Uisz(4) - 10; 
end 
lc = fsz(1) + (size(handles,input，2) 1/ 4) + (3 “ fsz(3) / 4) 
lc = min(1cC，sSSz(3) - Uisz(3) 一 10) 
set(handles.ice，'Position ，[JCc bc 463 391] ) ; 
e1Se 
bc = round((SSz(4) - Uisz(4)) / 2) 一 10; 
lc = round((sSsz(3) - Uisz(3)) / 2) - 10 
set{hand1les,ice，'Position' ，[1C bc UISZL(3) UiISz(4) ] ) |; 
end 
set{handles.ice， Units ， normalized ) ; 
graph(hand1es) ; Penderfhand]1es ) 


5 
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+Ce_OutputFcn 
接 和 rn 


“7OPT OPe 语 ra 
-Fle， 


s Update handles and make ICE wait before exit if required. 
guidata(hobject，hand1es ) ; 
if Strcmpi(wait ， "on ') 

Uiwalit(handles.ice); 


function varargout = ice_0utputFcn(hobject，eventdata，handles) 

5 After ICE is Cl0sed，get the image data of the current figure 
for the output，If handles' existSs，ICE isn't Closed (there was 
5 no Uiwait ' ) So output figure hand1e， 


if max(Size(handles)) == 
figh = get(gcf); 
imageh = get(figh.Children); 
jif max(Sizefimnageh)) > 0 
image = get(imageh.Children) ; 
varargoutt1} = image.CData; 
end 
8e1Se 
varargout{1}+ = hobject; 
end 


Rather than examining the intricate details of these functions (see the code% 
comments and consujlt Appendix A or the index for help on Specific func- 
tons), we note the following commonalities with most GUI opening and out- 
put functions: 


1 TIhe handles structure (as can be seen from its numerous references 记 
the code) Plays a central role in most GUI M-files, It serves two crucial 
functions. Since it provides handies for all the graphic objects in the inter- 
face, it can be used to access and modify object Properties. For instance， 
the ice opening function uses 


Units ， “pixels' ) ; 
“Position' ) ; 


set{(hand1les,ice， 
UISZ = get(handles.ice， 


to access the Size and location of the ICE GUI (in pixels). This is accom- 
plished by setting the Units property of the ice figure, whose handle is 
available in handles ,ice, to 'pixels' and then reading the Position 
PpPIOperty of tbhe figure (using the get fnction), The get function, which 
returns the value of a property associated with a graphics object, is also 
Used to obtain the computer's display area via the ssz = get(0， 
Screensize' ) statement near the end of the opening function. Here, 0 is 
the handle of the computer display (ie.,root figure) and 'Screensize' is 
a property containing its extent， 

in addition to providing access to GUI objects,the handles structure is 
apowerful conduit for sharing application data.Note that it holds the default 
values for twenty-three global ice Parameters (ranging from the mouse 
state in handles,updown to the entire inputimage in hand1les， input).Ihey 
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must SUIVIVe eyery call to ice and are added to handles at the start of 
ice_0peningFcn. For instance, the handles.set1 global is created by the 
stateIment 


handles.Set1 = {f00; 11] 


where set1 is a named fieid containing the control points of a color map- 
ping functlon to be added to the handles structure and [0 0; 1 1] isits 
defaujt value [curve endpoints (0, 0) and (1, 1)]. Before exiting a function 
im which handles is modified， 


guidata(hobject，hand1les ) 


must be cajlled to store variable handles as the appjlication data of the fig- 
ure with handle hob j ect. 


Like many built-in graphics functions, ice_OpeningFcn Processes inpnut 
arguments (except hO0bject, eventdata,and handles) in property name 
and value pairs. When there are more than three input arguments (i.e., 让 
nargin > 3), a Joop that skips tbhrough the input arguments in pairs [for 
ii=1:2:(nargin-3)]isexecuted.For each pair of inputs, the first is used 
to drlive the Switch construct， 


Switch Jower(varargin{fIr 


whlch processes the second parameter apPIopriately For case 'space'， 
for instance, the statement 


handles,colortype = lower(varargin{fi + 1T}); 


Sets named fieljd colortype to the value of the second argument of the 
input pair. This value is then used to setup ICE's color component popup 
options (il.e.,the String PIOperty of object component_popup).Later,it is 
used to transform the components of the input image to the desired map- 
ping Space Via 


hand1les,input = eval(['rgb2， 
handles.colortype “(handles.input) ]); 


where builtin function eval(S) causes MATLAB to execute string s as 
an expresslon OF Statement (see Section 12.4.1 for more on function 
eval). 过 handles.input is 'hsv',for example,eval argument [rgb2， 
Phsv (handles.input)'] becomes the concatenated string 
rgb2hsv(handles,input)' ,which is executed as a standard MATILAB 
expresslon that transforms the RGB components of the input imagse to the 
也 >V color Space (See Section 6.2.3). 
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3。 Thpe statement 
55 ULIwalit (handjes.fTigure1) ; 
in the starting GUI M-Hjle ls converted into the conditional statement 


If strcmpi(wait， on' ) uUliwait(handles.ice);i end 


In the final version of ice_0peningFcn. In general， 
ULIWait (fig ) 


blocks execution of a MATLAB code stream Until either a Uiresunme is 
executed or fgure fig is destroyed (ie., closed), [With no input argu- 
ments, Uiwait ls the Same as Uiwait(gcf) where MATLAB function gcf 
returns the handle of the current figure]. When ice is not expected to re- 
turmn a mapped version of an input image,but return immediately (i.e., be- 
fore the ICE GUI is closed)，an input Property name/value pair of 
Wait' /off' Imust be inciuded in the call. Otherwilse, ICE wiil not re- 
turn to the caljing routine or command line until it ls closed. That is, until 
the user js finished interacting with the interface (and color mapping func- 
tions),， In this Situation，function ice_0OutputFcn can not obtain the 
mapped Image data from the hand1les structure, because it does not exXist 
after the GUIJ is closed.As can be Seen in the final version of the function， 
ICE extracts the image data from the CData Property of the Surviving 
Iapped image output figure. If a mapped output image js not to be re- 
turned by ice,the uiwait statement in ice_ 0peningFcn is not eXecuted， 
ice_0outputFcnls called imnmediately after the opening function (long be- 
fore the GUIis cljosed),andthe handle ofthe mapped image output figure 
is returned to the cajlling routine or command line. 


Finally，we note that several internal functions are invoked by 
ice_0peningFcn. These 一 and all other ice internal functions 一 are jisted 
next. Note that they provide additional exampjles of the usefulness of the 
handles Structure In MATELAB GUISs. For instance, the 


nodes = getfieldf(handles，handlLIes.curve ) 


and 


nodes = getfieldf(lhand1les，[ ' Set'” num2str(I)]) 


statements in internal functions graph and render, Tespectively are used t0 
access the interactively defined control points of ICE's various color mappling 
cuUIVes. In its Standard form， 


F = getfield(S，field  ) 


returns to F the contents of named fleld field' from structure S. 
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function graph(hand1lesS) ice 
% Interpolate and plot mapping functions and optional reference Wi 
%% PDF(S) or CDF{S). 


nodes = getfield(handles，hand1les.curve) ; 
c = handles.cindex; dfx = 0:1/7255:1; 
colors = [k Fr gb]; 
% For pliecewise Linear interpolation，plot a map，map + PDF/CDF，or 
% map + 3 PDFS/ACDFS ， 
计 ~handJjes.smooth(handles.cindex) 
if (~handles,pdf(c) & -handles.cdf(Cc)) | ，,， 
(Size(handles.df，2) == 0) 
plot(nodes(:，1)，nodes(:，2)，'b-'，,， 
nodes(:，1)，nodes(:，2)， "KoO' ，..， 
-Parent ，handles.Ccurve axes) ; 
el]Seif C> 1 
II=2*CcG-~2-handles,pdf(c); 
plot(dfx，handles.df(i，:)，fcolors(c) “一 ] ，,.， 
nodes(:，1)，nodes(:，2)，“'k 一 ， 
nodes(:，1)，nodes(:，2)， 'ko' ，，,， 
-Parent ，handles,Curve axes ) ; 
elLSeif C == 
II= handles.cdf(c); 
plot(dfx，handles,df(i+ 1，:)， fr- ，.，,， 
dfx，handles.df(i+ 3，:)， "9g-' ，,,，. 
dfx，handles.,df(I+ 5，:)， 'b- ，.，,， 
nodes(:，1)，nodes(:，2)，'k- ，,,， 
nodes(;，1)，nodes(:;，2)， :ko ，,.， 
-Parent ，handles,Curve_axeSs ) ; 


Fial AM-Fie reraalt 
万 5cHOPS 


end 


5 Do the Same for Smooth (cubic Spline) interpolations ， 
el1Se 
X=0:0.01:1; 
if ~handles,SLopefhangd1es.cindex) 
yY = S$pline(nodes(;，1)，nodes(:，2)，Xx) ; 


elLSe 
y = S$plinefnodes(:，1)，[0; nodes(:，2); 0]，Xx) 
end 
1I1=Tfindty > 1); y(1LI) = 1; 
TI=find(y<0); y(1) = 0) 


if (~handles,pdf(c) & ~handles.cdf(C)) | ，,， 
(Sizelhandles.df，2) == 0) 
plLot(nodes(:，1)，nodes(:，2)， ko ，X，yY， "b- ， 
Parent ，handJes,Curve_axeSs ) ; 
elSelif C > 1 
1I1=2*xC--2-handles,.pdf(c)i 
pljot(dfx，handles,df(I，:)，[colors(Cc) 一 ]，,..， 


nodes(:，1)，nodes(:，2)，'ko'，X，YyY， "KK-' ， 
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Parent ' ，handles.cuUrve _axes) ; 
elSelif C == 
1 = handles,.cdf(c) 
plot(dfx，handles.df(i+ 1，:)， Pr-， 


dfx，handles.df(Ii+ 3，:)， 9g-， 
dfx，handles.df(Ii + 5，:)， b- ，. 
nodes(:，1)，nodes(:，2)， ko ，Xx，y， kk- ， 
Parent ' ，handles.curve_ axeSs) j 
end 
end 
% Put legend if more than two curves are Shown， 
s = handles.colortype ; 
if Strcmp(Ss， ntsc ) 
3S= yiq ; 
end 
if (C== 1)& (handles.pdf(c) | handles.cdf(c)) 
S1=[-- ”Upper(sS(1))] 
if Length(s) == 
S2 = -upper(s(2))]) sS3 = [-- upper(s(3))]; 
esSe 
sS2=['-- "upper(S(2)) sS(3)]) sS39= [| -upper(sS(4)) St5)1) 
end 
e]LSe 
1 S2= 5 
end 
Set(handles.red text， :String ，S1) ; 
set(hand1les,green text， String ，S21) 
set(handles.blLue text， String ，sS3) | 
%--------- -aa 和 


function [inplot，x，y] = cursor(th，hand1es ) 

% Translate the mouse position to a coordinate with reSpect to 

5 the current plot area，check for the mouse in the area and jif SO 
% Save the location and write the coordinates below the plot . 


set(lh， ' Units ， " pILIXxels ) ; 
P= get(lh， CurrentPoint ) ; 
x= (pl1，1) ~ handles.pLotbox(1T)) / handJles.plotbox(3) 
y= (pl(1，2) - handles.pLotbox(2)) / handies.pLotbox(4) 
ifx>1.05 | xx< -0.05 |yY> 1.05 | y< -0.05 

inplot = 0 
el1Se 

X= min(x，1); XxX = max(X，O0) 

yY = minty，1); y = max(y，0) 


nodes = getfield(handles，handles.CcuUrve ) 

X = _ round(256 * X) / 256; 

InpJlot = 1; 

set(hand1les.input text， "String ，num2sStr(Xx，3)) 1 
set(handles.output text， ' String ，Pnunm2strty，3)) 1 
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UnitSs ， normalized ' ) ; 


function y = _ render(handles ) 
先 Map the input image and bar components and convert them to RGB 
先 (if needed) and display. 


set(handles.ice， 'Interruptipb1e'， 
set(handles.ice， ' Pointer '， 
ygb = handles.graybar， ycb = hand1les.colorbar; 
Yi = handlies.input ， mapon = handjes.barmap， 
imageon = handles.imagemap & Size(handles,input， 1) ; 


for 1 = 2:4 
nodes = getfield(handles，['set' num2str(i)]); 
t = lut(nodes，handles.smooth(i)，handles,slope(i)); 
If Imageon 


下 下 
watch ' ) ; 


yI(:， : 二 1 ) 芝 YU :1 Rs 1 ) 直 1 ) ; 
end 
LIf mapon 

ygb(:，:， 一 1) = t(ygb(:，:，I 一 1) + 1T); 

ycb(:，':，i 一 1) =t(Yycbl:，:，1i--1) +1); 
end 


end 
t = Jut(nandles,set1，handles.smooth(1)，handles.slope(1))， 
if imageon 
人 
end 
If mapon 
yYgb = t(Ygb + 1 )) 
end 


ycb = 十 (Ycb + 1); 


if ~sStrcmp(handles.colortype， 'rgb ') 
堪 f size(handles.input， 1 ) 


ylL1 = YI / 255; 
yl = eval([Ihand1les.colortype 2rgb(yIi) ]) |; 
y1 = Uint8(255 * Yi) ; 
end 
ygb = ygb / 255 ; ycb = ycb / 255; 
ygb = eval([handles.colortype 2rgblygb) ]); 
ycb = eval([hand1les.coLlortype 2rgb(ycb) ] ) 
ygb = Uint8(255 * ygb) ; ycb = Uint8(255 * ycb); 
else 
YI = Uint8(yi) ， ygb = Uint8(ygb) ; yYcb = Uint8(ycb ) ; 


end 


if Size(handles.input，1) 
figure(hand1les.output ) ; 

end 

yYgb = repmat(ygb，[32 1 1]); 

axes(handles.gray axeSs) ， 

axes (hand1les.colLor axes ) ; 

figure(handles.ice)，; 


Imshow(yz) ; 


ycb = repmat(ycb，[32 1 1])， 
imSshow(ygb ) ; 
ImSshowf(ycb ) ; 
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“arrow ) 
on 1 


set(handies,ice， 'Pointer ， 
set(handles.ice， 'Interruptibjle ， 


function t = Lut(nodes，Smooth，SL0ope ) 

Create a 256 element mapping function from &a Set of control 
points， The output values are integers in the interval [0，255] ， 
Use piecewise Linear or cubic SpLine With or Without zero end 
Slope Interpolation， 


LI =0:255; 


Go do go 避 


t = 255 * nodes j 
1Lf ~Smooth 

二 [2562256] RD 
eLSe 


if ~Sjlope 
1 
elSe 
t = Spline(t(:，1)，[0i tt(:，2); 0]，1 二 ); 
end 
end 
t = round (七 ) OO t = min(255， 十 ) 


function out = Spreadout (inl) 
和 5 Make all x values unique ， 


% Scan forward for non-unique x's and bump the higher indexed X-- 
% but don't exceed 1， Scan the entire range， 
nudge = 1 / 256; 
for = 2:Size(in，1) 一 1 

if in(I，1) <= in(1 一 1，1) 

in(zi，1T) = min(in(I 一 1，1) + nudge，1); 

enqd 

end 


% Scan in reverse for non-unique x's and decrease the 1Lower indexed 
%Xx -- but don't go below 0. Stop on the first non-unique pair， 
if infend，1) == in(end 一 1，1) 
for = Size(in，1):-1:2 
if in(Ii，1) <= in(I 一 1，1) 
in(i-1， 1 = max(in(iz 1) 一 nudge，0) 
el1Se 
break ; 
end 
end 
end 


% If the first two x's are now the Same，init the CUrve， 
if in(f1，1) == in(2，1) 
in= [00;) 11] 
enad 
Out = 革 n; 





function g = rgb2cmy(T) 
% Convert RGB to CMY Using IPT's imcompLement， 


dg = imcompJement(Tf) ; 


function g = Chmy2rgb(Tf) 
% Convert CMY to RGB using IPT'S imcomp1lLement， 


g = ImcomplLement (了 ) ; 


B.2.3 Figure Callback Functions 





The three functions immediately following the ICE opening and closing func- 
tions in the starting GUI M-file at the beginning of Section B.2 are 户 grre ca 由 
packs ice_WindowButtonDownFcn， ice WindowButtonMotionFcn， and 
ice_WindowButtonupFcn. In the automatically generated M-file, they are 
jpctiomn StD5 一 that js MAITLAB function definition statements without 
Supporting code. Fuly developed versions of the three functions, whose joint 
task 1s to process Inouse events (clicks and drags of mapping function control 
polnts on ICE'S curve_axes object), are as follows: 


function Ice_WindowButtonDownFcn(hobject，eventdata，handles) 

各 Start mapping function Control point editing. Do move，add，or 

委 delete fonr left，middle，and right button mouse ClLicks ('normal '， 
55 “extend ，and 'alt' cases) Over plLot area， 





set{(handles,cUrve _axeSs， Units'， 'pixe1S' ) ; 
handJes,plotbox = get(hand1es,CcUrve _axeSs， 'Position' ) ; 
攻 set(handles,curve _axes，'Units'，'normalized') 
国 [inplLot，x，y] = cursor(h0bject，hand1es') ; 
1 计 inplot 
加 nodes = getfield(handles，handles,.curve) ; 
II= find(x >= nodes(:，1)); beJow = max( 工 ) ; 
above = mint(below + 1，Sjizeflnodes，1)) | 
if (x - nodes(below，1)) > (nodes(above，1) -~ XI) 
node = above; 
el1Se 
node = beLoW; 
endl 
deletednode = 0; 


sWitch getfhobject， "` SelectionTYpe ') 
Case “normal- 
if node == above 
above = min(above + 1，Size(nodes，1)); 
elJSeif node == bejJow 
below = max(below 一 1，1) 








业 - 
人 


、 end 
If node == Size(nodes，1) 
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beJlow = above ; 
elSeif node == 
above = belLowW; 
end 
if x> nodes(above，1) 
X = nodes(above，1) ; 
elsSeif X < nodes(below，1) 
Xx = nodes(below，1T) 
end 
handles.node = node; hangdJjes.updown = “down ; 
handles.below = below; handles,above = above; 
nodes(node，:) = [X yj]; 
CasSe “extend 
if ~Jengthffind(nodes(:，1) == X)) 
nodes = [fnodes(1:below，:); [x y]; nodes(above':end，:)]; 
hand1es.,node = above; handjes ,Updown = down ' ; 
handjes,below = belowj ”handles,above = above + 1 ; 
end 
CaSse alt 
if (node ~= 1) & (node -= Sizefnodes，1)) 
nodes(node，:) = []; deletednode = 1; 
end 
handles.node = 0; 
Set(handjes.,input text， String ， ) 
set (handles.output text， "String' ， 和) |; 
end 


handles = Setfield(handles，handles,curve，hnodes ) ; 
guidata(hobject，hand1es ) ; 
graph(hand1les ) ; 
if deletednode 
render(handles) ; 
end 


end 


人 


function ice_WindowButtonMotionFcn(hobject，eventdata， hand1es ) 
bo nothing unless 3a mouse 'down' event has occurred， If it has， 
当 modify control point and make new mapping function， 


if ~Strcmpilfhandles.updown， 'down ' ) 


Peturn 


end 


[inplot，x，yj = cursor(hobject，handles ) ; 
ij Inplot 
nodes = getfield(handles，handljes,curve); 
nudge = handles,smooth(handles,.cindex) / 256; 
if (handles,node ~= 1) & (handles.node -= sizelnodes，1)) 
if xX >= nodes(handles.above，1) 
Xx = nodes(thandJes ,abovej 1) -nudge; 
elseif X <= hodeslhand1es,below，1) 
x = nodes(handles.below，1) + nudge， 
end 


0 
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elSe 
if Xx > nodes(handles.above，1) 
Xx = nodes(handles.above，1)， 
elSejif X < nodes(hand1es ,below，1) 
X = nodes(fhandles.below，1)， 
end 
end 
nodesthandles.node，;:) = [fx y]， 
handles = Setfield(handles ， handles,curve，nodes ) ; 
guidata(hobject，hand1les ) ， 
graph(hand1es ) ， 


function 1ce_WindowButtonUpFcn(hObject ， eventdata，hand1es) 
光 Terminate 0ngoing control point move or add operation，Clear 
乞 Coordinate text below plot and Update diSplay， 


update = Strcmpi(hand1les.updown， down ' ) ; 


handles,updown = :Up'， handles,node = 0 
guidata(hobject，hand1les ) ; 
if Update 


set(hand1es,input_text，'String' ，'“…) 
Set(handles,output_text，'String' ， 25 
render(handles ) ; 

end 


) 
) ; 


Jn general, figure callbacks are launched in Tesponse to interactions with a fig- 
Ure object or window 一 not an active uicontrol object. More specifically， 


。 jhe WindowButtonDownFcn is executed when a user clicks a moOuse but- 
ton with the cursor in a figure but not over an enabled uicontrol (e.g.,3 
pushbutton or popup menu). 

。 [pe WindowButtonMotionFcn is executed when a user moves a de- 
pressed mouse button within a fieure window. 

。 Jhe WindowButtonUpFcn is executed when a user releases a mouse but- 
ton, after having pressed the mouse button within a fgure but not Over an 
enabled uicontro1l. 


The purpose and behavior of ice figure callbacks are documented (via com- 
ments) in the code. We make the following general observations about the 
final implementations: 


二 Because the ice_WindowButtonDownFcnis called on all mouse button ciicks 
in the ice figure (except over an active graphlc object), the first job of the 
cajiback function is to see 让 the cursor is within ice's plot area (i.e., the ex- 
tent of the curve_axes object), If the cursor is outside this area, the monse 
Should be jgnored.The test forthis is performed by internal function cursor， 
whose listing was provided in the previous section. In curso mr, the statement 


P = get(th， "CurrentPoint ' ) ; 
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returns _ the current _ cursor coordinates. Variable h is Passed from 
ice WindowButtonDownFcn and originates 38 input argument hobject. 
In all figure callbacks, hObject is tpe handle of the figure requesting Ser- 
vice. Property 'CurrentPoint contains tne position of the cUISOT rela- 
tive to the figure as a two-element TOW VectorT [xy]. 


Since ice is designed to work with two- and three-button mice， 
ice WindowButtonDownFcn must determine whjich mouse button causes 
each callbacKk. As can be seen in the code, this js done with a Switch con- 
struct using the figure'Ss "SelectionType  PIOPEeItY Cases “normal '， 
extent' and 'alt' correspond to the left, middje, and Tight button 
clicks on three-button mice (or the left, shift-left, and control-left clicks of 
two-button micej, respectively, and are Used to trigger the add control 
point, move control point, and dejlete control point Operatlons，. 


The displayed ICE mapping function 18 updated (via internal function 
graph) each time a control point 1S modified, but the output figure, whose 
handle is stored in handles,output,is updated on 1a0Se DUOP reljeasey 
only. This is because the computation of the outpat image, which 18 Per- 
formed by internal function Prender, can be time-consuming. It InvVolves 
mapping separately the input image s three color components, remapPping 
each by the“all-component”curve, and convertinag the mapped compo- 
nents to the RGB color space for display. Note that without adequate pre- 
cautions，the mapping function's control points could be modihed 
inadvertently during this lengthy output mapping process， 

To prevent this, ice controls the interruptibility ofits various callbacks.A] 
MATLAB graphics objects have an Interruptiple PITOPerty that determines 
whether their callbacks can be interrupted.The default value of eveIy object S 
Interruptibl1e' property is "on ,which means that object callbacks can be 
interrupted.JIf switched to "off ,callbacks that occur during the execution of 
the now noninterruptible callback are either ipgnored (ie.,cancelled) OF Placed 
in an evemt guete for later processing. The disposition of the interrupting cal- 
back js determined by the 'BusyAction' Property of the object being inter- 
rupted. 于 BusyAction 18 cancel ， the callback is discarded; 证 “queue ， 
the caliback is processed after the noninterruptible calijback finlshes. 

The ice WindowButtonUpFcn functlon USes the _ mechanism just de- 
scribed to suspend temporarily (Le., during Output image computations) the 
user's ability to manipulate mapping function control points. TIbe Sequence 


set{thand1es.ICce， Interruptib1le' ， "off ) 
set(hand1les.ice， ' Pointer ， Watchn ) 
set{(handles.ice， 'Pointer ， arrow ) 1 
Set(hand1eSs .ICe， Interruptib1e'， on ) 1 


in internal function render sets the ice figure windowS ， TInterruptibl1e 
property to 'off' during the mapping of the output image and pseudo- and 
full-color bars.This prevents users from modifying mapPping functlon control 





下 2 


2 Set(hand1es .Ice， 


Points while a mapping is being Performed, Note also that the fgures 
'Pointer' Property is set to 'watch' to indicate visually that ice 1S busy 
and resetto 'arrow' when the output computation is completed. 


.2.4 Object Calljback Functions 


The final nine lines of the starting GUI M-file at the beginning of Section 也 .< 
are Object callback function stubs. 工 这 e the automatically generated figure ca 
backs of the Previous section, they are initially void of code. Fujiy developed 
versions of the functions follow. Note that each function Processes UseI lnteT- 
action with a different ice uicontrol object (Pushbutton, etc.) and is named 
by concatenating its Tag property with string '_Callback ' .For example, the 
callback function responsible for handling the Selection of the dsplayed map- 
ping function is named the component_popup_Callback. ft is called when the 
User activates (ie., clicks on) the popup sejector. Note also that Input argu- 
ment h0bject is the handle of the popup graphics object- 一 not the handle of 
the ice figure (as in the figure callbacks of the Previous section), ICE'S object 
callbacKks involve minimal code and are self-documenting. 


function component_popup_Callback(hobject，eventdata，handles ) 
% Accept coLor component selection，update Component Specific 
%% parameters on GUI，and draw the selected mapping function， 


C = get(hobject， Value ) | 

handJles,.cindex = Cj 

handles.curve = strcat('set' ，num2str(c) ) ; 
guidata(hobject，handles ) ; 

set(hand1les .smooth_checkbox， 'Value  ，hand1es.Smooth(Cch ) 1; 
set(hand1les,.s1ope_checkbox， 'Value' ，handles,SlLope(ci) ) 
set(hand1les.pdf checkbox， "Value ，hand1les.pdf(c) ) 


set(handles.cdf_checkbox，'Value'，handles.cdf(c) ); 
2 graph(hand1les ) ; 


function smooth_checkbox_Callback(hobject，eventdata，hand]es ) 
% Accept smoothing parameter for currentlLy Selected Color 


:5% component and redraw mapping function， 
冯 If get(hobject， 


Value ) 
handljes.smooth(handles.cindex) = 1 

nodes = getfield(handles，handles.curve ) ; 
nodes = Spreadout (nodes ) ; 


-end 


handles = setfield(handLles，handles.curve，nodes ) ; 
81Se 
hand1les.smoothf(handles.cindex) = 0j 
guidatalhobject，handlies) ; 
fsSet(handles,ice， ' Pointer ， Watch ) ; 
graphl(handJes ) ; render(handJIes) ; 
'Pointer ， arrow ) 
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function reset_pushbutton_Caljlback(hobject，eventdata，handles ) 
% Init all display parameters for currentJly Selected Color 
汪 component ，make map 1:1，and redraw 1t， 


handles = Setfleld(handles，Phandles.curve， [0 0; 1 1]) 

C = handJjes.cindex; 

handjljes ,smooth(c) = 0; set(handles .Smooth_checkbox， Value ，0) 
handles.SLope(c) = 0; set(handles.SIo0pe_checkbox， Value ，0) |; 
handles.pdf(c) = 0 set(handles.pdf_checkbox， Value ，0) 
handjes.cdf(c) = 0; set(handles.cdf checkbox， Value ，0) ; 
guidatathobject，hand1les ) ; 


心 


set{hand1les.ice， :Pointer ， Watch ) ; 
graph(handJes ) ; render(handJles) ; 
set(handlies.ice， Pointer ， arrow ' ) 


function Slope_checkbox_Callback(hobject，eventdata，hand1es ) 
%5 Accept Siope Clamp for currentJly Selected color Component and 
% draw functIon if Smoothing 1S on， 


if get(hobject， ' Value  ) 


handles.sSLope(handles.clndex) = 1 
elSe 

handles.Slopel(handles.cindex) = 0; 
end 


guidata(hobject，handles ) ; 

if handles,smooth(hand1les.cindex ) 
set(handles.lice， 'Pointer ， "Watch ' ) ; 
graph(hand1l1es ) ; render(hand]les ) ; 
Setthandles.ice， Pointer ， "arrow' ) ; 
end 


funct1Ion resetall_pushbutton_Callback(hobject，eventdata，hand1Les ) 
s JInit display parameters for colonr components，make al1l maps 13:1， 
%5 and redraw displLay， 


for cC = 1:4 
hand1les.Smoothf(c) = 0; hand1es.S1Lopel(c) = 0 
handles.pdf(c) = 0 handles.cdf(c) = 0; 


handles = Setfieldf(handles，[f'set' num2str(c)]，[0 0:;: 1 1]); 
end 
set(thandJles.Smooth_checkbox， 'Value ，0) ; 
set(hand1les.S10pe_checkbox， :Value' ，0); 
set(handles.pdf_checkbox， ' Value'，0); 
set (handles.cdf_ checkbox， "Value ，0) ; 
guidatal(hobject，hand]les ) ; 
set(handjes.ice， 'Pointer ， "watch' ) ; 
graphl(handles) ; render(handles) ; 
Set(handles.1Ice， ' Pointer  ， arrow ) ; 





AL 省 
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2 guidata(hobject，hand1les ) ; 


hand1les.imagemap = get(hobject， 'Value ') 


和 sc 人 raisece airav RISE 
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function pdf_checkbox_Callback(hobject， eventdata，hand1les ) 

多 Accept PDF (probability density function or histogram) display 
多 parameter for currentLy Selected color component and redraw 

5 mapping function ij Smoothing is on， TI Set，Cclear CDF display， 


if get(hobject，'Value') 
handles.pdf(handles.cindex) = 1 
set(hand1les.cdf_checkbox，'Value' ，0) ; 
handles.cdf(handles.cindex) = 0; 


el1Se 
handJles.pdf(handles.cindex) = 0; 
end 
guidata(hobject，hand1es ) ; graphk(handjles) ; 


function cdf_checkbox_Callback(hObject， eventdata，hand1es ) 

和 Accept CDF (cumulative distribution function) display parameter 
% for Selected Color component and redraw mapping function if 

5 Smoothing is on， If set，CclLlear CDF display ， 


if get(hobject，'Value ') 
hand1es.cdf(handlies.cindex) = 让 
set(handles.pdf_checkbox，'Value'，0); 


handles.pdf(handles.cindex) = 0 ; 
elLSe 
handles.cdf(handles.cindex) = 0 | 


end 
graph(hand1les ) ; 


ge- 0 % 

2 function mapbar_checkbox_Callback(hObject， eventdata，hand1les) 

”名 Accept changes to bar map enable state and redraw bars， 

hand1es,barmap = get(hobject，'Value' ); 

二 guidata(hobject， handJIes ) ; render(handles ) ; 
和 和 


function mapimage_checkbox_Callback(hobject， eventdata，hand1les ) 


5 Accept changes to the image mapb state and redraw image ， 


? 


guidata(hobject，hand1les) render(handles ) ; 





PreVDIeZD 

This appendix contains a listing of all the M-functions that are not listed earl- 
erin the book.The functions are organized alphabetically. TIhe first two jines of 
each function are typed in bold letters as a visual cue to facilitate finding the 
function and reading its Summary description， 


A 


function 下 = adpmedian(g，Smax ) 

%ADPMEDIAN Perform adaptive median filtering . 

% F = ADPMEDIAN(G，SMAX) performs adaptive median filtering of 

% image G， The median filter Starts at Size 3-by-3 and lterates up 
% to size SMAX-by-SMAX. SMAX must be an 0dd integer greater than 1 . 


% SMAX must be an 0odd，positive integer greater than 二， 

if (Smax <= 1) | (Smax/2 == round(Smax/2)) | (Smax ~= round(Smax ) ) 
errorf'SMAX must pe an 0odd integer > 1，) 

engd 

[M，N] = Size(g) 


%5 Initial Setup， 

f = g| 

人 人 二 

alreadyProcessed = false(Size(g)); 


5 Begin filtering， 
for K = 3:2:Smax 


zmin = ordfilt2(g，t，ones(k，Kk)， 'Symmetric ) ; 
zmax = Ordfilt2(g，K * kK，ones(k，K)， SymmetrIc ) 
zmed = medfilt2(g，[Kk Kk]， 'Symmetric ) ; 





0 
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proceSsSUSsingLevelB = {zmed > zmin) & (zmnax > zmed) & ，,， 
_ -alreadyProcessed 

ZzZB= (g> znin) eg (znax > g); 

outputZxy = procesSsUSsingLevelB & ZzB; 

outputZmed = proceSsSUSIngLeve1lB & ~zB; 

ff(outputZxy) = g(outputZxy) ; 

f(outputZzmed) = zmed(outputzmed ) ; 


alreadyProcessed = alreadyProcessed | processUsingLeveJB; 
if all(alreadyProcessed(:) ) 
breakK ; 
end 
end 


%5 Output zmed for any remaining Unprocessed pixelSs， Note that thils 
% zmed Was Computed Using a window of Size Smax-by-Smax，wWhich 1S 
% the final value of k in the 100p， 

f(~aLlreadyProcessed) = zmed(-~alreadyProcessed ) ; 


function rcC_new = bound2eight(rc) 

%BOUND2EIGHT Convert 4-connected boundary to 8-connected boundary ， 
RC_NEW = BOUND2EIGHT(RC) converts a four-connected boundary to an 
eight-Cconnected boundary， RC is 8& P-by-2 matrix，each row 0f 
which contains the row and colLumn coordinates of a boundary 
pixel， RC must be a closed boundary; in other words，the 1ast 
row of RC must equal the first row of RCR， BOUND2EIGHT removes 
boundary pixels that are neceSSary for four-connectedness put not 
necessary for eight-connectedness， RC_NEW 1S a Q-by-2 matrix， 
Where Q <= P. 


op op dp op de opoop 


if ~iSsempty(rc) & -isequal(rc(1，:)，rcfend，:)) 
error( EXpected input boundary to be closed，) ; 
end 


If Sizelrc，1) <= 3 
% Degenerate CasSe 
PFC_new = PC 
Feturn 

end 


% Remove 1ast row，which equals the first FowW， 
rc_newWw = rc(1:end 一 1，:) 


Remove the midd1e pixel in four-Connected right-angle turns.， We 
can do this jin a vectorized fasnhion，but We can't do if all at 
once，Similar to the Way the “thin' algorithm works in bwmorph， 
We'11 remove first the midd1e pixels in four-connected turns Where 
the row and column are both even; then the middle pixels in all 
the _ remaining four-connected turns Where the row is even and the 
coLumn is 0dd;， then again Where the row is 0dd and the CoJUumn 1S 
even;， and finally where both the row and column are 0dd， 


Ge co opg oo o 归 op oe 
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remove_J]ocations = Compute_Premove_jocatlons(Prc_newj) 

field1 = remove ]ocations & (Penm(rc_new(:，1)，2) == 0) & .,， 
(Prem(rc_new(:，2)，2) == 0) |; 

rc_new(field1，:) = 【[]; 


remove_ Jocations = Compute_remove_j1locations(rc_new) ; 

field2 = remove locations & (Prem(rc_new(:，1)，2) == 0) & ... 
(Prem(rc_new(:，2)，2) == 1]); 

rc_new(field2，:) = []; 


remove_jocations = Compute_remove_1ocations(rc_new) ; 

field3 = remove 10cations & {PFem(rc_new(:，1)，2) == 1) & .,. 
(rem(rc_new(:，2)，2) == 0) | 

rc_new(field3，:) = 门 ; 


remove_1ocations = Compute_remove 10ocations(rc_new) ; 

field4 = remove locations & (rem(rc_new(:，1)，2) == 1) & ..,，. 
(rem(trc_new(:，2)，2) == 1)); 

rcC_new(field4，:) = []; 


% Make the output boundary closed again， 
rc_new = [rc_newi rc_new(1，:)]; 


function remove = Compute_remove_1Locationstrc) 


% Circular diff ， 
d= [rcl2:end，:);， rcl1，:)] -Prci 
和 5 


Dot product of each row of d with the Subseduent row of d， 
和 5 performned in Circular fashion， 
dl = [df(2:end，:);， d(1，:)]; 
dotprod = Sum(d .* d1，2); 


5%5 上 Oocations of N，S，E，and W transSitions followed by 
% aa right-angle turn， 
Penmove = -all(d，2) & (qdqotprod == 0) | 


5%5 But We realjy Want to remove the middle pixel of the tuUPn. 
remove = fremove(end，:) ;remove(1:end 一 1，1:)]; 


if ~any(remove ) 
done = ]; 
e]1Se 
idx = find(remove ) ; 
rclidx(1)，:) = [) 
end 


function rc_new = bound2four(rc) 

55B0OUND2FOUR Convert 8-connected boundary to 4-connected boundary， 
RUC_NEW = BOUND2FOUR(RC) converts an eight-connected boundary to a 
four-connected boundary，， RC is a P-by-2 matrix，each Fow of 
Which contains the row and column coordinates of a boundary 
piIxel， 8BOUND2F0OUR inserts new boundary Pixels Wherever there is 
a diagonal connection， 


oo co oo oo op 


if Size(rc，1) > 1 
% Phase 1; remove diagonal turns，one at a time until they are al1l gone， 





Appendix( 副 M-Funtions 


done = 0; 
rcl = [rclend -~- 1，:) rc] 
whiJe ~done 
d= diff(rcf，1)，; 
diagonal_locations = all{d，2); 
double_diagonals = diagonal _ locations(1:end - 1) & ,,， 
(diff(diagonal_ locations，1) == 0)， 
double_diagonal_idx = find(double diagonals); 
turns = any(d(double diagonal idx，:) -= ... 
d(double_diagonal_ idx + 1，:)，2); 
turns_lidx = doubl1e_ diagonal_idx(turns) ; 
if isempty(turns_idx) 
done = 1 ; 
el1Se 
first turn = turns_idx(1); 
rcf(first_turn + 1，:) = (rcf(first turn，:) + ，,， 
FESL TREE 直 22 和 大计 2 
if first_ turn == 
二 rci(end，:) = Pc1(2，:); 
癌 end 
end 
end 
rclf = rc1(2:end，:); 


Oo 
相 


end 
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% Phase 2:; insert extra piXxelSs Where there are diagonal connections . 


rowdiffT = difflrc1(:，1))， 
coldiff = diff(rcf(:，2) ) | 


diagonal_locations = Prowdiff & coldiff; 

num_ 01d_pixels = Sizef(rc1，1)， 

num_new_plixelSs = num_0ld_pixels + Sum(diagonal_locations)， 
rc_new = zeroSs(num_new_pixelSs，2)， 


各 Insert the original values into the proper locations in the new RC 
乞 站 iatrix， 

idx = (1:num old _ pixels)' + [0; cumsum(diagonal_1Jocations) ] ; 
ro_new(idx， :) = Prcfi 


和 Compute the new pixelS to be inserted. 

new_pixel_offsets = [0 1; -1 0; 10; 0 -1]; 

offset_ codes = 2 * (1 一 (coldiff(diagonal_ locations) + 1)/2) + ， 
《2 - (rowdiff(diagonal locations) + 1)/2)， 

new_pixels = rcf(diagonal _ locations，:) + ,,. 
newW_Plxel_offsets(offset codes，: ); 


5 Where do the new pixels g0? 

insertion 1ocations = zeros(num_new_pixels，1)， 
insertion_ locations(idx) = 1 
Insertion_locations = -~insertion_ 1ocations， 


人 下 人 RPR 
汪 人 宙 本 人 0 人 


% Insert the new pixels， 
rc_new(insertion_locations，:) = new_pixels， 


和 
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function B = bound2im(b，M，N，Xx0，yY0) 

%BOUND2IM Converts a boundary to an image ， 

B = BOUND2IM{b) converts b，an np-by-2 or 2-by-np array 
representing the integer coordinates of a boundary， Lnto a binary 
image with 1s in the 10cations defined by the coordinates :1n b 


and 0S$ el1Sewhere ， 


B = BOUND2IM(b，M，N) places the boundary approximately Centered 
in an M-by-N image. If any part of the boundary is outside the 
M-by-N rectangle，an error 1S issued， 


8B = BOUND2IM(b，M，N，X0，Y0) places the boundary in an image of 
sjze M-by-N，with the topmost boundary point 10cated at X0 and 
the 1Leftmost point located at Y0， If the shifted boundary 1S 
outside the M-by-N rectangle，an error is isSsued， X0 and X0 must 
be positive LiLntegers . 


oo deoceooop oocoece co oo oo oo oo 


[np，nc]l = Size(b); 

If np < nc 
b = b';5%To convert to Size np-by-2. 
[np，nc] = Sizelb) ; 

end 


Make Sure the _ coordinates are lintegers . 
= round(b(:，1)); 
= round(lb(:，2) ) ; 


Set Up the default Size parameters ， 
=X 一 min(Xxh) 二 了 

YE 

= falsefmax(X)，max(y) ); 

= max(x) 一 minfXxh) + 十 ; 

max(y) 一 min(ty) + 1; 


CO SR XXX oo 到 o 


LIf nargin == 1 
% Use the preceding default Values， 
elS$elf nargin == 3 
i 计 CC>MHM|lD>N 
error( 'The boundary is outside the M-by-N region，) 
end 
% The image size Will be M-by-N. Set up the parameters for this， 
B = falselM，N) ; 
Distribute extra rows approx，even between top and bottom， 
R = Proundf(M 一 C)72) 1; 
C = round((N - D)/2); % The Same for coLumns， 
= X+NR;5% Offset the boundary to new posit1on， 
yY =y+NCi 
elSeif nargin == 
if x0O<01|yos<0 
error('x0 and y0 must be positive integers，) 


N 
N 
X 


end 
X=X+round(xo) 一 1 
yY=yY+round(y0) -1T; 
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C=C+Xxo--1; 
D=D+yo--1 
IfC>M1|ID>AN 
error( The Shifted boundary is outsiqde the M-by-N region，.') 
end 
B = falsel(MH，N) |; 
eJ1Se 
error( Incorrect number of inputs .') 
end 


1 让 


B(sub2ind(size(B)，x，y)) = true， 


function B = boundaries(BW，conn，dir) 

%BOUNDARIES Trace object boundaries.， 

B = BOUNDARIES(BW) traces the exterior boundaries of objects in 
the binary image BENW.、B is a P-by-1 cell array，where P is the 
number of objects in the image，Each cel1 contains a Q-by-2 
matrix，each row of Which contains the row and Column coordinates 
of a boundary pixel.， Qis the number of boundary pixe1ls for the 
Corresponding object， 0bject boundaries are traced in the 
CLOCKkWw1se direction， 


8B = BOUNDARIES(BW，CONN) specifies the connectivity to Use When 
tracing boundaries， CONN may be either 8 or 4， The default 
Value for CONN liS 8， 


B = BOUNDARIES{(BW，CONN，DIR) Specifies the direction Used for 
tracing boundaries， DIR Should be either 'cw' (trace boundaries 
CLOCKWi1Se) Or "ccw' (trace boundaries Counterclockwise)， If DIR 
1S omitted BOUNDARIES traces in the CLockWise directIion， 


oopooesocohoeoooedo oopoep 


if nargin < 3 


diPr = CCwW 
end 
if nargin < 2 
conn = 8; 
end 


L = bwlabel(BW，conn) ; 


%5 The number of 0bjects is the maximum value of L. Initialize the 
%5 Cell array B So that each cell initially contains a 0-by-2 matrix， 
numobjects = max(L(:) )， 
if numobjects > 0 

B = {zeroS(0，2)1}; 


B = repmat(B，num0bjects，1); 
e]1Se 

8 = [ji 
end 


s Pad 1abel matrix With zeroSs.， This lets US Write the 

s5 boundary-fo1L1owing 100p without worrying about going off the edge 
:50f the image， 

Lp = padarray(L，[1 1]，0，'both' ); 
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etighbors ， 
Size(Lp，1); 
if conn == 8 
%5 0Order 1SNNE ESE SSWWNW. 
offsets = [-1;， M 一 1 MT+ 1 1 -+T 惟 ， 民 -1; 
el1Se 
%5 Order 1S NE SWW. 
offsets = [-1，M，1， 二 ] ; 


% Compute the 1inear jindexing offsets to take US from a pixel to itSs 
n 


end 
% next_ Search direction lut is a LOokup tab1le， Given the direction 
% from pixel K to pixel k+{t，WwWhat is the direction to Start With When 
% examining the neighborhood of Pixel kt+1? 
计 f Conn == 
next Search direction lut = [882244686] 
8e]Se 
next Search direction lut = [4 1 2 3]; 
end 


% next_direction_ lut is a 1ooKup tab1e， Given that We just Looked at 
s%5 neighbor in a given direction，which neighbor do we Jook at next? 
if conn == 8 

next _ direction Jut = [23456781]; 


el1Se 
next_ direction lut= [2341]; 
end 
% Values Used for marking the Starting and boundary piIxels， 
START = 一 | ; 
BOUNDARY = 一 2; 


% Initialize Scratch Space in which to record the boundary Pixels 3aS 
% Wel1 as folLow +the boundary . 
scratch = zeros(100，1) ; 


% Find candidate Starting Locations for boundaries ， 
[Err，ccl = find((Lp(2:end-1，:) > 0) & (Lp(1:end-2，:) == 0))); 
人 


for k = 1:1Length(rr) 

PP= Prk); 

C = Cocfk); 

计 (Lp(rc) > 0) & (Lptr 一 1，c) == 0) & 1sempty(B{Lp(r，c)}) 
% We' ve found the Start of the next boundary， Compute tS 
% Linear offset，record which boundary it isS，nmark jit，and 
多 Linitialize the counter for the number of boundary pixels， 
1dx = (C-1)*Size(lLp，1) + 下; 
which = LpP(IdX) 


SCcratch(1) = 1Ldx; 

Lpl(idxy = START ; 

numPixels = 1; 

CUrrentPixel = IdxX; 

initial_ departure_ direction = []) 
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done = 0; 


next_Search_direction = 2; 
While -~done 


上 ind the next boundary pixel， 
direction = next_Search_direction; 
found_next_pixel = 0， 
for K = 1:1ength(offsets) 
neighbor = CurrentPixel + offsets(direction) 
if Lp(neighbor) -= 0 
和 Found the next boundary pixe1， 


if (Lp(currentPixe1) == START) & ,，, . 
isSempty(initial departure_direction) 
各 We are making the initial departure fron 
多 the Starting pixel， 
initial_departure_direction = direction; 


elSeif (Lp(currentPixel) == START) & .,， 
(initial_departure_ direction == direction) 
5 We are about to retrace ouUr path ， 
生 That means We're done， 
done = ] ; 
found_next_pixel = 1， 
break; 
end 


e 
有 


% Take the next Step along the boundary， 
next_Ssearch direction = ，. 
next_search_direction_lut(direction】 
found_next_pixel = 1 
numPIXe1S = numPixelS + 十 ; 
if numPixe1Ss > Size(Scratch，1) 
# Double the Scratch space， 
scratch(2*size(SCratch，1)) = 0 
end 
Scratch(numPixelSs) = neighbor ; 
if Lp(neighbor) ~:= START 


tp(neighbor) = BOUNDARY， 
end 


和 
和 


currentFixel = nejighbor; 
break ; 
end 


人 = next_direction_ lut(direction) ， 
en 
if ~found_next_pixel 
各 If there is no next neighbor，the object must just 
5 have a Sing1le pixel， 
numPixels = 2; 
scratch(2) = Scratch(1); 
done = 1; 
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end 
end 
% Convert linear indices to row-column coordinates and Save 
% in the output cell array， 
[row，colL] = ind2sub(sSize(Lp)，Scratch(1:numPixeJS)) 
B{fwhich}y = [row -1，col 一 1; 
end 
end 


和 5 
%5 0Ver，add zeros to extend xmax and vmax 
% number of grid lines are obtained . ee 
% 3S1Z86 needed in the Xx-direction: 

L = gridsep + 1 
nn = CeiJ(Xxmax/L) ; 
TJ= (n 一 1)x*Lr+ 1 
反 





if strcmp(dir， ccw ) 
for k = 1:1ength(B) 下 工 S the number of zer0s that Would be needed to have grid 
B{k} = B{k}(end:-13:1，:); ， 2 In b being 1eft over， 
本 if Zx == 
ZX = 0; 
end 


function [fs，SuU] = bsubsanmp(b，gridsep) 

%BSUBSAMP Subsample a boundary . 

[S，SU] = BSUBSAMP(B，GRIDSEP) subsampjes the boundary 昌 by 
assigning each of its points to the grid node to Which 工 世 工 5 
closest， The grigd is specified by GRIDSEP，wWhich is the 
separation in pixelLs between the grid Lines. For examp1e，jf 
GRIDSEP = 2，there are two pixels in between grid lines，So，for 


先 Number of grid lines in the x-direction，With L pixel SpaceSs 
% in between each grid 1Line， 
6GLX = (Xmax + ZX 一 1)/AL+ 1 


% And for the y-direction: 
0 = Ceil(ynmax/L) ; 
= (nn 一 4)*Ld+T; 


instance，the grid points in the first row Would be at (1 1)， 三 
(1,4)，(1;,6)，...，and similarly in the yY direction，The Value 7 3 有 本 
of GRIDSEP must be an even jinteger. The boundary is specified by 2 2 

| 


a set of _ coordinates in the form of an np-by-2 array， It 1iS 


assumed that the boundary is one pixel thick， 
GLy = (ynmax + ZyY 一 1)/ 人 LT+ 1 


s% Form vectors of Xx and y grid locations， 

I = 1:GLX; 

人 和 Vector of grid line 10cations intersecting X-axis， 
| x{(TI) = gridsep*I + {I- gridsep)， 

= 13:GLy; 

攻 和 Vector of grid line locations intersecting y-axiS， 
加 Y(J) = gridsep*yJ + (dd - gridsep); 


Output S is the subsampled boundary，Output SU 1S normalized SO 
that the grid separation is Unity， This is useful for obtainlng 
the Freeman chain code of the subsampled boundary， 


ddd ooe co ce co on oo 


% Check input， 

[np，nc]j = Sizel(b) ; 

If np < nc 
error('B must be of Size np-by-2，) 

end 

if gridsep/2 -= round(gridsepy/2) 
error('GRIDSEP must be an even integer，) 

end 


Compute both components of the cityb1lock distance between each 
二 element of b and all the grid-1line intersections， Assign each 

世 和 point to the grid location for which each comp of the Cityb1Lock 

站 | as 1eSS than gridsep/2，Because gridsep is an even 

# 6 lnteger，these asSignments are Unigque， | 

9 unlique，Note the Use of meshgrid to 
DIST = gridsep/12; 

，[xXG，YG] = meshgrid(X，Y) ; 


9 


Some boundary tracing programs，Such as boundaries,m end with 
% the beginning，resulLting in a Sequence in which the coordinates 

of the first and last points are the Same. If this is the Case 
in b，eliminate the Last point. 





if isequallb(1，:)，b(np，:)) 和 Q = 1 
np=np 一 1 for k=1:np 
b =b1inp，2) 人 [I] = find(abs(XG - b(k，1)) <= DIST & abs(YG - b(k，2)) <= ,.. 
end 和 DTST) 
生 | = ， 
% Find the max X and yY Spanned by the boundary， 起 0 
ord = Kk*ones{(IL，1); % To keep track of order of input coordinates 


Xmax = maxtfb(:，]) ) 
ymax = max(b(:，2) ) ; 
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K=Qr+IL- 1 
dl1(Q:K，:) = Cat(2，X(I)，ord) ; 
d2(Q:K，:) = Cat(2，Y(J)，ord) | 
Q=K+T; 

end 


d is the set of points asSsigned to the new grid with line 
Separation of gridsep. Note that it is formed as d=(d2,d1) to 
compensate for the coordinate transposition inherent in Using 


meshgrid (See Chapter 2) . 
= Cat(2，d2(:，1)，d1); % The second column of d1 is ord， 


Sort the points Using the values in ord，which is the 1ast colL in 


C 


fliplr(d)， 点 5S0 the ast Column becomes first， 
Sortrows(d ) ; 
f]Jiplr(d);，%% FLIip back. 


ElLiminate duplLicate rows in the first two ComponentSs 0f 
d to create the Output. The CwW or ccw order MUST be preserved. 


二 .oo 口 aoo ce do de 
Il il LN 


ooe ce 


S=d(:，1:2); 

[S，m nl] = unique{(S， PowS ' ) ; 

% Function unique Sorts the data--Restore to original order 
%5 by UsSing the contents of 站。 

S= [Sn]) 

s = flipJLr(Ss); 

Ss = SortrowSs(S) ; 

s$ = fliplr(Ss) ; 

S=S(:，1:2); 

5 Scale to Unit grid So that can Use directLy to obtain Freeman 
5% chain code The Shape does not change， 


Su = round(s,./gridsep) + 了 |; 


《 


function image = changeclass(Cclass，Vvarargin) 
5CHANGECLASS changes the storage class of an image， 
%5 I2 = CHANGECLASS(CLASS，I) ; 

名 “RGB2 = CHANGECLASS(CLASS，RGB ) ; 

%5 8BW2 = CHANGECLASS(CLASS，BW) ; 

汪 X2 = CHANGECLASS(CLASS，X， "indexed ' ) ; 


Copyright 1993-2002 The MathWorks，Inc. Used with permission， 
$Revision: 1.2 $ $Date: 2003/102119 22:09:58 4 


SWitch class 
Case “Uint8， 

image = im2uint8f(vararginf :}); 
case Unt16: 

image = Im2uint16(vararginf 3: ) 


Ge 


Ge 
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CaSe doub1e， 

image = im2double(varargin{f :})， 
otherwise 

error( Unsupported IPT data class ) ; 
end 


function [VG，A，pPpG]= cojlorgrad(f，T) 

%C0LORGRAD Computes the vector gradient of an RGB image, 
[Ve，VA，PPG] = COLORGRAD(F，T) computes the vector gradient，VG 
and corresponding angle array，VA，(in radians) of RGB image 

F， It also computes PPQ，the per-plane composite gradient 
obtained by Summing the 2-D gradients of the individual color 
planes， Input T is a threshold in the range [0，1]， If it is 
LIncJuded in the argument 1List，the Values of VG and PPG are 
thresholded by letting VG(xyY) = 0 for values <= T and VG(Xx,Y) = 
VG(xy) otherwise，Similar Comments appjy to PPG.， If T is not 
included in the argument list then T is Set to 0，Both output 
gradients are scaled to the range [0，1]. 


if (ndims(f) -= 3) | (size(f，3) -- 3) 


error( Input inmage must be RGB ， ) ; 
end 


了 


ooopcpoeode oo yo 


5 Compute the x and y derivatives of the three comnponent imageS 
% USing Sobel operators， 
sh = fspecial( 'sobel'); 


sSV = sh'; 

PRXx = imfilter(double(f(:， :11)，sh，' replicate')， 
By = imfilter(double(f(:， :，1))，SV， replicate' )， 
6Gx = imfilter(double(f(:， :，2))，Sh， replicate ' ) ， 
Gy = imfilter(double(f(:，:， 2))，SV， 'replicate' ) ; 
Bx = imfilter(double(f(f:， ， 3 引 )，sh，' replicate')， 
By = lmfilter(doublje(f(:，;， 3))，SV， 'Preplicate ')， 


委 Compute the parameters of the vector gradient， 


gxX = Rx,^2 + Gx.^2 + BXx.^2，; 
gyy = Ry. 2 + Gy.^2 + By.^2， 
gxXy = RX.*Ry + GX.x*Gy + BXx,.*By; 


A= 0.5*(atan(2*gxy./(gxx - gyy + eps)))， 
61 = 0.5*((gxx + gyy) + (gxx - gyYyY) ”coS(2*A) + 2*gxy,*sin(2xA) ) 


5 Now repeat for angle + pi/2. Then Select the maximum at each Point . 


A 二 风 二 /2， 

62 = 0.5*{(gxx + gyy) + (gxx 一 gyy).*cos(2*A) + 2*gxy,x*sin(2xA) ); 
6G1 = 61,^0,5; 

62 = 62.^0.5， 


% Form VG by picking the maximum at each (xy) and then Scale 
% to the range [0， 全 [和 

VG = mat2gray(max{(G1 ， G2 1) ) ; 

5 Compute the per-plane gradients， 

RG = Sqrt(RXx.^2 + Ry. “2) ; 

6G = Sqrt(Gx.^2 + Gy. “2) |; 
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SWitch method 
case “euUClidean' 

名 Compute the Euclidean distance between all1 rows of X and m，S$ee 

5 Section 12.2 of DIPUM for an expJlanation of the fol1]lowing 

5 expression. D(i) is the Euclidean distance betwWeen Vector X(i，, :) 

%5 and Vector 由 ， 

P = Length( 了 ) ; 

D = Sqrt(Sum(abs(f -- repmat{(m，p，1)).“2，2)) 
Case “nahalanobis' 

4 = varargin{f5] ; 

D = mahalanobis (ff，C， 由 ) 
otherwise 

error( Unknown segmentation method， ) 
end 


8G6 = Sqrt(Bx.^2 + By. 2) ; 

委 Form the Composite by adding the individual results and 
%5 SCale to [0，1]， 

PPG = mat2gray(RG + 6G + BG) ; 


和 5 Threshold the result， 
if nargin == 2 
VGQ = (VG > T) ,<*VG; 
PP RPRG 光 本 YPRG: 
engd 


外 





function 工 = Co0lLorseg(varargin) 

%C0LORSEG performs Segmentation of a Color image. 
5 = CULORSEG( EUCLIDEAN'  ，F，T 了 ，M) performs Segmentation of color 
image F USing a Euclidean measure of Similarity. M is a 1-by-3 
Vector representing the average CoOLor used for Segmentation (this 
is the center of the Sphere in Fig，6,26 of DIPUM). T is the 
threshold against which the distances are Compared ， 


3 


加 D is a Vector of Size MN-by-1T containing the distance computations 
名 了 rom al the coLor pixels to vector m. Find the distances <= T， 
J = find(D <= T); 


s Set the Values of I(J) to 1. These are the segmented 
点 COJ0F pixelSs， 
T(JU) = 1 


% Reshape I jinto an M-by-N image . 
T = reshape(I，M，N) ; 


和 

拘 

站 

5 9 = CULORSEG( MAHALANOBIS' ，F，T，M，C) performs segmentation of 
5 Color inage F_ using the Mahalanobis distance as a measure of 

汪 Slimilarity. C is the 3-by-3 covariance matrix of the Sample color 
5%5 Vectors of the class of interest. See function covmatrix for the 
乞 Computation of C and M， 

和 

抑 


3 is the Segmented image (a binary matrix) in Which 0Ss denote the function c = connectpoly(x， y) 


background， 5C0ONNECTPOLY Connects vertices of a polygon， 

# (CC = CONNECTPOLY(X，Y) connects the pointS with coordinates given 

Prelinminaries， 汪汪 5 dinxXxandYwith straight Lines，These polints are assumed to be a 

5 hecall that varargin is a cell array， 5 S$equence of polygon vertices organized in the clockwise or 

f = varargln{f2]i 区 ”Counterclockwise direction，The Output，C，is the set of points 

下 (ndins (ff) -= 3) | (Size(fT，3) | 5 along the boundary of the polygon in the form of an nr-by-2 
errort Input image must be RGB ) 洗 Coorqinate sequence in the same direction as the input， The last 

end | 5 point in the sequence is equal to the first， 

MH = Size(f，1);N= Sizeff，2); 下 

% Convert f to vector fornmat using function inmstack2vectors ， | V= [x(:)，Y(:)] 

[f，L] = imstack2vectors(f); 5% Close polygon， 

f = double(f); 和 评 -isequal(v(end，:)，v(1， : ) ) 

多 Initialize I as a colLumn Vector， It Will be reshaped later 二 v(end + 1，:) = v(1，?)， 

% into an image. 


end 


工 = zeroS(M*N，1T); 8 ， 
T = vararginf3}; 5 Connect Vertices， 
m = vararginf4} ， 和 SegmentSs = CeJl1(1，1length(v) - 1); 
月 = mi(:) 55WMake sure that m is a row vector， for Is 2:length(V) 
0 人 = intline(vT 一 11) v(T TvI-1 2)，v(T，2)); 
if lengthfvarargin) == 机 Segmentsf{fI 一 1}) = [X，y]; 
method = "euclidean |; end 


elSseif Length(varargin) == 
method = 'mahalanobis ' ; 
e1LSe 
error( Wrong number of jinputSs。 ); 四 D 
end -function 5 = diameter(L) 
9DIAMETER Measure diameter and related properties of inmage regions， 


它 


5 9 = DIAMETER(L) computes the diameter，the major axis endpointSs， 


cc = Cat(1，Segments{ :})， 
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the minor axis endpointSs，and the basic rectangle of each Labeled 
region in the label matrix L. Positive integer 6jementS of 上 
correspond to different regions，For examp1le，the Set of 61LementS 
of L equal to 1 corresponds to region 1; the Set of 8ejlementSs of | 
equal to 2 corresponds to region 2; and SO0 on， 3S is a Stpructure 
array of Length max(L(:))， The fields of the StPucture array 
Include : 


Diameter 
MajorAxis 
MinorAXiS 
8asSicRectang1e 


The Diameter field，a Scalar，iSs the maximum distance between any 
two pixels in the corresponding region， 


The MajorAxis field is a 2-by-2 matrix， The Pows contain the Fow 
and column _ coordinates for the endpoints of the major axis of the 
Corresponding region， 


The MinorAxis field is a 2-by-2 matfrix， The rows Contailn the Fow 
and colLlumn coordinates for the endpoints of the minor axis of the 
Corresponding region， 


The BasicRectangle field is a 4-by-2 matrix， Each row contains 
the row and Column coordinates of a corner of +the 
region-enc1losing rectangle defined by the majoPr and minor axesS ， 


For more information about these measSurementS，S6e6e Section 11.2 .1 
of Digital Image Processing，by Gonzalez and Woods，2nd edition， 
Prentice Hal1l， 


op ce ooceoeoaceceeceooeceooo ecopoocooocpdeoce ce de op oo 


s = regionprops(L，{f Image ， BoundingBox }) ; 


for k = 1:length(Ss) 
[s(K) .Diameter，S(k).MajorAxis，perim FF，perim C] = ,,， 
Compute _ diameter(S(K) ) 
[s(k).8asicRectangle，Ss(k).MinorAxlIs] = ,,， 
Compute basic_rectangjle(sS(k)，perim Fr，perinm cy) ; 


function [d，majoraxis，r，c]j = Compute diameter(Ss) 

%  [D，NMAJORAXIS，R，C] = COMPUTE_DIAMETER(S) computes the diameter 
% and major axis for the region represented by the Structure 3，S 
% must contain the fields Image and BoundingBox .COMPUTE DIAMETER 
% 8lso preturns the row and column coordinates (R and C) of the 

各 perimeter Pixels of S$,Image ， 

站 


Compute row and coiumn coordinates of perimeter piIXxejls， 
IFr，c] = find(bwperim(s,Image)) 
FF=『(:) 


o op op ooooe de oo 
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C=c()i 
[rp，cp] = prune_pixel_ list(F，C)， 


num_pixelSs = length(Prp)， 
SWitch num _ pixelSs 


Case 0 

d = ~Inf; 

majoraxis = ones(2，2); 
Case | 

d = 0; 

majoraxis = [rp cp; rp cp]， 
Case 2 


d= (rp(2) -rp(1T))*2 + (cp(2) - cp(1T)) 2; 
ma]joraxis = [rp cp]; 
otherwise 
5 Generate alI1 Combinations of 1:num plIxels taken two at at tinme， 
5 Method suggested by Peter Acklan， 
站 加 人 2 (下 过 findttril(ones(num_pixe1s) ， 一 1) ) ; 
rr = Prp(Idx); 
cc = CPp(idx) ; 
dist_ Squared = (Pr(:，1) - 2 
(cc 上 二 2 
[max_ dist_squared， idx] = max(dist_ squared ) ; 
majoraxis = [Prr(idx,:) ”cc(idx,:) ]; 
d = Sqrt(max _dist_ squared)， 
Upper_image_row = S.BoundingBox(2) + 0.5; 
Jeft_image_col = S$,.BoundingBox(1) + 0,.5; 
majoraxis(: ， 1) = majoraxis(:，1) + upper_image_row -1 ; 
ma]joraxls(:，2) = majoraxis(:，2) + left_image_col ~ 1 


function fbasicrect， minoraxis] = Compute_basic_rectanglefSs， 

erim PP 
[BASICRECT,WMINORAXIS] = COMPUTE_BASIC RECTANGLEIS ， 和 
PERIM_C) computes the basic rectanglie and the minor axis 
end-points for the region represented by the Structure 3 S must 
Contain the fields Image， BoundingBox，NWMajorAxis，and 
Diameter， PERIMLR and PERIM_ C are the row and column coordinates 
of perimeter of s,Image，BASICRECT is a 4-by-2 matrix，each Fow 
of Which contains the row and column coordinates of one corner of 
the basic rectangle， 


% Compute the orientation of the major axis， 

theta = atan2(S.MajorAxis(2，1) ~ S.MajorAxis(1，1)， 
S,MajorAxis(2，21) 一 S ,MajorAxis(1，2) ); 

Form rotation matrix， 


和 
= [cos(thetal) Sin(theta); -Sin(theta) cos(thetay) ] 


了 
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Rotate perimeter pixels， 


Ge 


p = [perim c perim_ mr ; 

和 

% Calculate minimum and maximum x- and y-coordinateS for the Protated 
%5 perimeter pixelS ， 

Xx= pl(:，1)); 

yY = pl(:，2); 

min Xx = mintX); 

max X = maxt(X); 

min y = min(y); 

max yY = max(y) ; 


cornerSs xX = [min x max Xx max _Xx min xj] ; 
、 上 昌 上 二 
corners yY = [min y min y max_y max_y] | 
% Rotate corners of the basic rectangj1e. 
corners = [corners X CornerSs_y] * T 


% Translate according to the region's bounding box， 

upper image_row = s,BoundingBox(2) + 0.5j 

left_ image_Ccol = s.BoundingBox(1) + 0.5; 

basicrect = [corners(:，2) + Upper_jmage_row 一 于 守 元 二 8 
corners(:，1) + left_image_col 一 1 


% Compute minor axis end-points，rotated， 


xX= (min x+maxX) /2 
Vy1 = miIn_ y， 
Y2 = max_y; 


endpoints = [X yY1; Xx yY2]; 

5 Rotate minor axis end-pointSs back. 

endpoints = endpointSs * T 

% Translate according to the region's bounding box , 

minoraxis = [endpoints(:，2) + Upper_image_row 一 .和 
endpoints(:，1) + left_ image_col - 1]; 

function fFr，c] = prune_pixel_ 1isttr， C) 

% [ffR，C] = PRUNE_PIXEL_LIST(R， C) fremoveS pixe1S from the Vectors 

BR and C that Cannot be endpoints of the major 3aX1S， This 

elimination is based on geometrical constraints described In 

RUSS，Image ProcesSing Handbook，Chapter 8 


oo Go ae 避 


top = mantr); 

bottom = max(r) 

1eft = min(c) ; 

right = max(Ch) 

% Which points are inside the Upper circle? 

X = (Left + Fight) /2， 

y = Topi 

radius = bottom 一 七 Dp， 
inside Upper = ( (CC 一 X)，2 + (F--y), 2) < radius <; 








证 
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%5 Which points are inside the LowWer Circle? 
y = bottom; 
InsSide Jower = ( (C 一 Xi"2+rtpr 一 yi 2 ) < radius^2; 


% Which polints are inside the Ieft CircLe? 

X = Jeft; 

Y = (top + bottom) /2; 

radius = right -Left; 

InsSlide jeft=( (C- xi.2+ (Pr-y. 2 ) < radius 2; 


点 Which points are inSide the right CiIrclLe? 
X = Pight; 
inside_right = ( (C- Xi).^2+ (一 y)2 ) < Pradius^ 2; 


多 EJiminate points that are inside al1 four CILrcles. 

delete idx = find(inside Jeft & inside_right & .,， 
inside_ Upper & inside Lower ); 

r(delete_idx) = []; 

cl(delete idx) = []; 


F 


function Cc = fchcode(b，conn，dir) 

%FCHC0ODE Computes the Freenman chain Code of a boundary， 

C = FCHCODE(B) computes the 8-connected Freeman chain code of 8 
set of 2-D coordinate pairs Contained in B，an np-by-2 array，(C 
1S a Structure With the foL1oWing TieJlds: 


CC = Freeman chain code {1-by-np) 

c.diff = First difference of Code C.fcc (1-by-np) 

C .mm = Integer of mininmum magnitude from c.fcc (1-by-np) 
c,diffmm = First difference of code c,mm (1-by-np) 

C,XxX0Oy0 = Cooprdinates Where the Code Staprts (1 -by-2) 


C = FCHCODE(B，CONN) produces the Same Outputs as above，but 
with the code connectivity specified In CONN. CONN can be 8 for 
an 8-connected chain code，or CONN can be 4 for a 4-Cconnected 
chain code，Specifying CONN=4 is Valid only if the input 
seduence，B，contains transitions With values 0，2，4，and 6， 
exClLUSIVejJy， 


C = FHCODE{B，CONN，DIR) produces the Same OutputSs as above，but， 
in addition，the desired code direction 1iS Speciflied，Values fo 
DIR Can be: 


Same as the order of the Sequence of points In b， 
This is the default ， 


Same， 


Feverse' Outputs the code lin the direction 0pposite to the 
direction of the points in B， The Starting point 
for each DIR 1IS the Same， 


osoedocoopooedcdocece de de Ja o 吕 
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The elJenments Of B are asSumed to CorreSpond to a 1-pixel-thick， 
ful1y-CcConnected，cJlosed boundary，B cannot Contain duplicate 
coordinate pairs，except in the first and 1Last positions，which 
1S 3a Commnon feature of boundary tracing programs ， 


FREEMAN CHAIN CODE REPRESENTATION 

The tabje on the jleft Shows +the 8-connected Freeman chain codes 
correSponding to allowed deltax，deltay palrs，An 8-chalin 1S 
Converted to a 4-Chajin jif (1) if conn = 4; ang (2) only 
transSitions 0，2，4，and 6 oOccur In the 8-code， Note that 
dividing 0，2，4，and 6 by 2 produce the 4-Code， 


本 


本 


0 0 0 
一 1 1 1 
一 1 0 2 1 
一 | 一 1 纪 
0 一 1 4 2 
1 一 | 5 
] 0 6 3 
1 1 7 


只 汪汪 本 本 让 本 本 本 可 由 加 本 吗 叫 


The formula Z = 4*(deltax + 2) + (deltay + 2) gives the follLowing 
sequence corresponding to rows 1-8 in the preceding tab1e: zZ = 
11,7,6,5,9,13,14,15，These values can be Used as indices into the 
tab1e，improving the Speed of computing the chain code， The 
preceding formula 1S not unlique，but jit 1S based on the smallest 
integers (4 and 2) that are powers of 2. 


dd 


上 5 PPeliminarnlies 


Lif nargin == 
dir = Same ' |; 
conn = 8; 


eJSelif nargin == 2 
di = Same |; 
elSelif nargin == 3 
5 Nothing to do here， 
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#% the same. If this is the case，eliminate the last point 
if IsSequal(b(1，:)，b(np，2:)) 

np = np 一 1 

b = bf(1:np，:)， 
end 


% BuUild the code table Using the single indices from the formula 
% for zZ given above: 

&(11)=0; 0C(7)=1;i 0C(6)=2; C(5)=3; C(9)=4， 

Cl(13)=5; C(14)=6; C(15)=7; 
和 


End of Preliminaries 


% begin procesSsing， 
xo = bf(1，1); 

Y0 = b(1，2) | 
C.xX0y0 = [x0，y0] ; 


#% Make Sure the coordinates are organized S$equentially: 

% Get the deltax and deltay between Successive pojints in b， The 
多 ]ast row of a is the first row of b. 

a = Circshift(b，[-1，0]); 


DEL=a--baisannr-by-2nmatrix in which the rows contain the 
deltax and deltay between Successive points in b， The two 
ComponentS in the kth Fow of matrix DEL are deltax and deltay 
between point (xk，yk) and (XK+1，yk+1)， The Jast row of DEL 
Contalns the deltax and deltay between (Xnr，ynr) and (x1，Yy1) 


o8 of op de 吕 


和 


各 (1.6,，between the last and first points in b) ， 


( 
EL = aa 一 bb; 


二 3 


% If the abs Value of either (or both) components of a pair 
各 (dejtax，deltay) is greater than 1，then by definition the Curve 
% is broken (Or the points are out of order)，and the progranm 
先 terminates， 
if any(abs(DEL(:，1)) > 1) | any(abs(DEL(:，2)) > 1 

有 input curve is broken or points are out of order，) 
en 


% Create 3a Single index vector USing the formula described above， 
z=4"(DEL(:，1f) +2)+(DEL(:，2) +2); 


Use the index to map into the table，The following are 
the Freeman 8-chain codes ， organized in a 1-by-np array. 
Tcc = C(z) ; 


e1Se Re 
error('Incorrect number of inputs.') 4 Check if direction of code sequence needs to be Feversed ， 
end If Strcmp(dir，'reverse ') 
[np，nc] = size(b); fcc = Coderev(fcc);i % See below for function coderev， 
if np < nc 到 end 
error('B must be of size np-by-2.')， 4 各 TI 4-connectivity is specified，check that al1 componentS 
end fo0f fcc are 0，2，4，or 6， 
% Some boundary tracing programs，Ssuch as boundaries.m，output a 计 f 加 
s sequence in which the coordinates of the first and last points are va = 1indtfcc ==1 | fcc==31|1fcoc==5|fcc==7); 


if isempty(val) 


/2 Appendix( 癸 M-Funtions 


fcc = fcc./12; 
el1Se 

Warning( The specified 4-connected code cannot be Satlisfied，) 
end 


end 
5 Freeman chain code for Structure output， 
c.fcc = fcc; 


% 0Obtain the first difference of fcc， 
.diff = Codediff(fcc,conn); 5 See below for function codediff. 


C 
s 0Obtalin code of the integer of minimum magnitude， 
cmm = minmag(fcc); 5 See below for function minmag 


s% 0Obtain the first difference of fcc 
c.diffmm = codediff(c.mm，conn) ; 


function cr = Coderev(Tfcc) 

5 Traverses the sequence of 8-connected Freeman chain code fcc in 
% the opposite direction，changing the values of each code 

5 Segment，The starting point jis not changed. fcc is a 1-by-np 

生 array. 


Flip the array left to right. This redefines the starting point 
as the last point and reverses the order of 上 travel through the 
code， 

cr = fJLipJr(fccy)， 


Next，obtain the new Code values by traversing the code in the 
0pposlte direction， (0 becomes 4，1 becomes 5，,,。，5 becomes 人 
6 becomes 2，and 7 becomes 3) ， 

ind1 = find(0 <= cr & cr <= 3)， 

ind2 = find(4 <= CF & cr <= 7)， 

cr(ind1) = cr(indf) + 4; 

crlind2) = crflind2) - 4; 


function z = minmag(c) 

5MINMAG Finds the integer of minimum magnitude in a chain code. 

5 和 = MINMAG(C) finds the integer of minimum magnitude in a given 
% 4- OF 8-connected Freeman chain code，C. The code is assumed to 
%5 be al1l-by-np array， 


% The integer of minimum magnitude starts With min(c) ， but there 
5 may be more than one Such valiue， Find them al1， 
I 


= find(c == min(c) ); 


and shift each one left So that it starts With min(c)， 让 


J = 0; 和 
A = zeros(length(I)，1Llength(c))， 于 
于 OF 二 隐 汪 开 1 
dd=d+T 
AtJ，:) = Circshift(c,[0 -(K-1) ])， 
end 
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% Matrix A contalins al1 the possible candidates for the jinteger of 
% finimum magnitude，Starting With the 2nd colLlumn，Succesively find 
% the minima in each CoLumn of A，The number of candidates decreases 
% as the Seach moves to the right on A， This is reflected in the 
% elements of J， When lengthfd)=1，one candidate remains. This is 
% the integer of minimum magnitude , 
[M，N] = Size(A) ; 
J= (13:M ; 
for kK = 2:N 
D(1:M，1) = Inf; 
D(d，1) = Ad，k) 
amin = min(Atd，kKk)); 
J=Tfind(D(:，1) == anmin); 
if length(d)==1 
Z=AJ， :); 
return 


有 


function d = Codediff(fcc，conn) 

%C0ODEDIFF Computes the first difference of a chain code. 

D = CQODEDIFF(FCC) computes the first difference of code，FCC. The 
code FCC is treated as a Circular sequence，S0 the 1ast elLement 
of D is the difference between the last and first elLementSs of 
FC4C， The input code 1S 8a 1-by-np vector， 


The first difference is found by counting the number of direction 
changes (In a Counter-CcLockwise direction) that Separate twWo 
adjacent elements of the code. 

S 六 = Circshift(fcc，[0，-4])， 5% Shift input left by 1 location， 

delta = Sr 一 于 cc 


ao 虽 oooaco oo de do 


d = delta; 
I=findtdelta < 0); 
type = Conn 


SWiLtch +type 

Case 4 5 Code 1S 4-Connected 
dtI) = d(I) + 4; 

case 8 5%5 Code 1S 8-Connected 
d(I) = d(I) + 8 

end 


6 


function g = gscale(f，Vvarargin) 

%GSCALE Scales the intensity of the zinput image . 

%5 6G= GSCALE(IF，' ful18 ) Scales the intenSsities of F to 上 the fu]1 
扣 8-blit intenSslity range [0，255]. This 1S the default 1if there 1S 


7 


7Y 
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only one input argument 


G = GSCALE(F， 'ful116 ) Scales the intensities of F to the fu11 
16-bit intenSsity range [0，65535] ， 


6 = GSCALE(F， minmax' ，LOW，HIGH) Scales the intensities of F to 
the range [LOW，HIGH] These values must be provided，and they 
must be in the range [0，1] ，independent1ly of the class of the 
input. GSCALE performs any neceSsSsary SCcaling. If the input is of 
Class double，and its values are not in the range [0，1]，then 
GSCALE Scales it to this range before processing， 


coopootoeooew ooede do db do 


The class of the Output is the Same as the Class of the input. 


if Jength(varargin) ==05% If only one argument it must be 了 ， 
method =“fu1l18 ;| 

el1Se 
method = varargin{ft1}; 

end 


if Strcmp(class(f)，'double') & (max(ff(:)) > 1 | min(f(:)) < 0) 
f = mat2gray(f) ; 
end 


各 Perform the specified Scaling. 
SWitch method 
Case “fuU118: 
g = im2uint8f(mat2gray(double(f)) ); 
CaSe ful116: 
g = imn2uint16(mat2gray(double(f) ) ); 
CasSe “ljInmaxX' 
Low = varargin{f2};，high = varargin{f3} |; 
if jow>11|1ow<01|1high>1 |high<0 
error( Parameters Low and high must be in the range [0，1].') 
end 
计 Strcmp(class(f)，'double') 
LIow in = minf(f(:)); 
high in = max(f(:)); 
6eJ]Seif strcmp(class(f)，'uUint8 ') 
low_ in = double(min(f(:)))./1255， 
high_jin = double(max{(f(:)))./255; 
elseif strcmp(Class(f)，'Uint16') 
Low_in = double(min(f(:)))./65535; 
high_in = double(max(f(:))),./65535; 
end 
s% imadjust automaticalLy matches the Class of the input， 
g = imadjust(f，f[Low in high_in]，[Low high] )， 
otherwise 
error( unknown method .  ) 
end 


本 
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function [X，R] = imstack2vectors(S， MASK) 
%ITWMSTACK2VECTORS ExtPacts vectors from an image Stack 

[x，R] = imstack2vectors(S，MASK) extracts vectors from S，which 
1s an M-by-N-by-n stack array of n registered jimages of Size 
Mby-N each (See Fig， 11.24)，The extracted vectors are arranged 
as the rows of array X， Input MASK is an M-by-N 1 0g1cal or 
numeric image With nonzero values (1Ss 计 it isa 10gical array) 
In the 10ocations where elements of S are to be used in forming X 
and 0S in locations to be ignored. The number of row vectors in xy 
is 6qual to the number of nonzero elements of MASK， If MASK 1S 
omitted，al1 M*N 1ocations are Used in forming X。 A Simple way to 
obtain MASK interactively is to Use function roipoly，FinallLy，R 
is an array Whose rows are the 2-D coordinates containing the 
region 10cations in MASK from Which the vectors in 9 were 
extracted to form X， 


名 oopRoposdodsoddode jos 


乞 PrelLinmiznaries， 
[M，N，n] = Size(S); 


if nargin == 

MASK = true(M，N) ; 
elSse 

MASK = MASK ~= 0 
end 


抑 Find the Set of 10ocations where the vectors WiJli be kept before 
% MASK is changed later in the progranm， 
[I，J]j = find(MASK) ; 


慰 二 本 志 有 

抒 Now find X， 

5 First reshape S into X by turning each set of n values along the third 
dimenSion of S So that it becomes a row of X， The order is fronm top to 
%5 bottom al10ng the first Column，the second Column，and So 0n， 

Q = MxN; 

X = reshape(S，Q，n) ; 

和 5 NOW reshape MASK so that it corresponds to the Fight 1ocations 


5 Vertically along the elements of X. 
MASK = reshape(MASK，Q，1)， 


% keep the rows of X at Locations where MASK is not 0. 
X = X(MASK，:); 


function [x，y] = intline(x{f，Xx2， y1，Yy2) 

%INTLINE Integer-coordinate line drawing algorithnm， 

[X，Y] = INTLINE(X1，X2，Y1，Y2) computes an 
approximation to the LIine Segment joining (X1，Y1) and 
(X2，Y2) with integer coordinates. X1，X2，Y1T，and Y2 
Shoujid be integerSs， INTLINE is reverSib1le; that is， 
INTLINE(X1，X2，Y1，Y2) produces the same results as 
FLIPUD(INTLINE(X2，X1，Y2，Y1) )， 


oo oo de oo do de 
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% Copyright 1993-2002 The MathWorks，Inc，Used with permisSion， 
% ， $Revision: 5.11 $ $Date: 2002/103115 15;57:47 $ 


dx = abS{(Xx2 一 X1) 
dy = abs(y2 一 y1) 
%5 Check for degenerate CasSe， 
计 ((dx == 0) 8& (dy == 0) 
X = XT; 
站 
return; 
end 
flip = 0 
if (dx >= dy) 
if {(X1 > X2) 
%5 AlLlways "draw" from 1Left to right， 
七 = Xi XxX1 = X2j X2Z = 七; 
t=yiyl=y2iyc = 
flip = 1; 
end 
mm= (yY2--yY+)/(x2 一 X1); 


汪 
中 外 省 


( 关 二 六 2 下 全 
yY = round(y1 + mx(X 一 XT)) 
elSe 
计 (Y1 > y2) 
% AlwaySs “draw” from bottom to top， 
t= xi; xl = X2j X2 = 鞋 ) 
t=yiyl=y2iyc = 二 
flip = 1 |; 
end 
站 = (Xx2 一 Xx1)/(Y2 一 y1) 
yY = (Y1:Y2) ，; 
X = round(x1 + mxf(y 一 Y1))) 
end 
if (flap) 
Xx = 隔 lIpud(X) ; 
y = flipud(y) 
end 


function phi = invmomentSs(F) 

%TNVMOMENTS Compute invariant momentSs of image。 

% PHI = INVMOMENTS(F) computes the moment invariants of the inmage 
% FF. PHI is a Seven-element row Vector containing the moment 


% jinvariants as defined in equations (11,3-17) through (11,.3-29) of 


5 Gonzalez and Woods，Digital Image Processing，2nd Ed 


% Fnmust be a2-D，real，nonsparse，numeric on 10gical matrix， 


if (ndims(F) ~= 2) | issparse(F) | -isreal(F) | -(isnumerlctF) 
1iS1ogical(F ) ) 


error{(['F must be a 2-D，real，nonsparse，numeric or 10gical “ ，,， 


matPzx。 ]); 
end 
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F = doublefF); 
bhi = compute_phif(compute_etal(compute _m(F)) ); 


洛 --------------------------- 气 
function 用 = Compute_m(F) 

[M，N]j = Silze(F) ; 

[x，yY]j = meshgridf(T:N，13:M) ， 


% Turn X，yY，and F into Column Vectors to make t+the Summations 3 bit 
%. eaSler to Compute ln the fol1owing， 

X = X(:)， 
y = y(I) 
F = 


和 DIP equation (11.3-12) 

和 .moo = Sum(F ) ; 

% Protect against divide-by-zero warnings， 
if (m.moo == 0) 


站 ,PoO0 = epSs; 
end 
点 The other Central moments : 
m.miDO = SUm(X ,*x F); 
mmol = Sum(yY .* F); 
mm,mlil1 = SUm(X ,* Y .* 下 )) 
mn.m20 = SUm(X.^2 .* 上 
m,mo2 = Sum(y,^2 ,* F); 
mm30 = SUm(X, 3 .* 上 ); 
nm.mo3 = Sumly., 3 .* 了 ) 
mi,mt2 = SUmX .* Y,^2 ,* 上 ); 
m,mn21 = SUT(X, 2 ,*yY .x 下 ); 


function 8e = Compute_etatf(m) 
点 DIP equations (11.3-14) through (11.3-16)， 


xbanr = mi ,ml0 /Am,mo0; 


ybar = 自 ,mo1 / m.mo0; 
e.etal1 = (m,mil - ybar*m,m10) /nm.nmoo^2 
e.eta20 = (m,m20 一 Xbarxm.m10) 六 前,mO00^2 ; 
e.eta02 = (m,mo2 一 ybarx*m.nmnoj) ”mnmoo “2 ; 
e.eta30 = (m,m30 一 3* xbar xm.m20+2，* xbarc2* mmioh nm,mnoo 2.5; 
e.eta03 = (m,mo3 一 3* ybar* mmo2+2x* ybar*2 * mmol) /mno0^2.5j 
e.eta21 = {m,m21 一 2* Xbar* mmil 一 ybar * mim20 + .，,. 
2*Xxbarc2* 站 .mol) mmoo>2.5; 
e,etal2 = {m,ml2 一 2x ybar* mn 一 Xpbar*fmo2 + ，,， 


2x* ybar^2 * 有 ,mt0) /m,mo0^2.5; 


function phi = Compute_phit(e) 
% DIP equations (11.3-17) through (11 .3-23)， 


人 
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P 


function B = pixeldup(A， 站 ，m) 

%PIXELDUP Dup1icates pixels of an image in both directions， 

%  B=PIXELDUP(A，M，N) duplicates 8ach pixel of AMW+times in +the 
vertical direction and N timeSs in the horizontal direction， 
Parameters M and N must be integers， If N is not included， 让 tt 


defaulLts to 从， 


%5 Check inputs ， 
if narglin < 2 
error( At 1Least two inputs are redqulred，) ; 
end 
if nargin == 2 
nm = 人 前; 
end 
5 Generate aa Vector With elLements 1:Size(A，1)， 
U = 1:Size(A，1) 
DupJlicate each element of the Vector m times . 
round(m);，% Protect against noniIntergers ， 
U(Ones(1T， 几 ) ， :) 
U 人 (3 ) 
点 Now repeat for the other direction， 
= 1:Size(A，2); 
= _ round(n) ; 
= V(Oones(1，n)， 1:) 


G@ ce de 


V 

咱 

V 
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B = Au ，V); 

function angles = polLlyangles(Xx，y) 

%POLYANGLES Computes internal polYgon angles . 

%5 ANGLES = POLYANGLES(X，Y) computes the Interior angl1es (in 

s degrees)j of an arbitrary po0lLygon Whose vertices are given in 

%55 [xx，Y，ordered in a ClLockwise manner， The program elLiminates 
%5 duplicate adjacent rows in [X Y]，except that the first Fow may 
%% equal the jast，SO that the polygon 1S C10Sed， 


% Preliminaries ， 
[x yj = dupgone(X，y); 上 55 ELiminate duplLicate Vertices， 
xyY = [xyY()]; 
if 1ISempty(Xy ) 
%5 NO Verticesl 
angles = zer0sS(0，1) ; 
return 
end 
if Size(xy，1) == 1 | ~isequal(xy(1，:)，Xxyfend，:)) 
光 CC10Se the polygon 
xy(end + 1， 二 XY(1， 2) 
end 


s5 Precompute Some quUantities ， 
d= diff(xy，1) 


3 


3 


下 
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v1T = -d(1:end，:); 

ve = [d(2:end，:)， d(1，:) 
v1_ dot v2 = SuUm(v1 .* V2，2); 
mag_v1T = Sdqrt(Sum(v1T .2，2) ); 
mag v2 = Sqrt(Sum(v2.^2，2) ); 


多 Protect agajinst nearly duplicate vertices; output angle will be 90 
% degrees for Such cases，The "real"” further protects against 

% possible Small imaginary angle Components in those Cases， 
mag_V1T(~mag Vv1T) = epSs; 

mag_v2(~mag v2) = eps; 

angles = real(acos(v1_dot v2 ,/ mag_yv1 ./ mag yv2) * 180 / pi)， 


% fhe first ang1le computed was for the Second vertex，and the 
多 Jast Was for the first vertex，Scroll one position down to 
% make the last vertex be the first， 
angles = Circshift(angleSs，[1，0] )， 


5% NOw determine if any vertices are concave and adjust the angles 
% accordingjly， 
sgn = Convex_ang1le_ test(Xy) ; 


s% Any element of Sgn that 'Ss -1 indicates that the ang1le is 
5 Concave，The corresponding ang1les have to be Subtracted 
from 360， 

I=Tfind(sSgn == 一 |); 

angles(I) = 360 - angles(I); 


function Sgn = Convex_ang1e test(Xy) 

The rows of array xy are ordered vertices of a polygon， If the 
kth angle is convex (>0 and <= 180 degress) then sgn(k) = 

1， Otherwise Sgn(k) = -1. This function assumes that the first 
vertex in the 1ist is Convex，and that no other vertex has a 
smaller value of x-coordinate， These two conditions are true in 
the first vertex generated by the MPP algorithm，Also the 
vertlces are asSSumed to be ordered in a clockwise sequence，and 
there can be no duplicate vertices， 


The test is based on the fact that every convex vertex is on the 
poSsitive Side of the line passing through the two vertices 
immediately fol110wing each vertex being considered， If a vertex 
Is concave then it Lies on the negative Side of the line joining 
the next two yertices. This property is true also 计 positive and 
negative are interchanged in the preceding two sentences ， 


oopmopeoepaoooedoeoede yb up 


%% It js asSSumed that the polygon is closed， If not，close 让 tt， 
1f size(Xy，1) == 1 | -isequal(xy(1，:)，xy(end，:)) 

Xy(end + 1，:) = XyY(1，:) 1 
end 


5 Sign Conyention: Sgn = 1 for Convyex vertices (1.,8e，interior angle > 0 
% and <= 180 degrees)，sSgn = -1 for concave Vertices. 
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和 EXtFeme polints to be Used in the followling 100p. AT1Tis appended 
% to perform the inner {dot) product with w，which is 1-by-3 (see 
%5 below) . 

L = 10 25; 

top Jeft = [~L，-L， 寺 ] ; 

top_right = [-L，1L，1]; 

bottom left = [L，-L，1T]; 

bottom_right = [L，L，T] 


Sgn=1; 各 JTJhe first vertex 1IS known to be Convex， 


% Start fo0LJowing the vertices ， 
for K = 2:Lengthf(xy) 一 1 
pfirSst= Xy(k 一 1，:); 
psecond = xy{(k，?:); 5 This is the point tested fOF COnVveXIity， 
pthird = Xy{k + 1，:) 
Get the Coefficients of the line (polygon edge) passing 
through pfirst and psecond， 
= p0lLyedge(pfirst，psecond ) ; 


Establish the positive Side of the line WwW1Xx + W2y + W3 = 0， 

The positive Side of the Line Should be in +the right Slide of the 
Vector (psecond - pfirst)， deltax and deltay of this Vector 
give the direction of travel. This establishes Which of the 
extrenme pointS (See above)j Should be on the + Slide， 于 that 


ee 吕 


oO aoceoaoeo 喜 


CO 


Gejltax = psSecond(:，1) 一 pfirst(:，1) 

deltay = psecond(:，2) 一 pflirst(:，2)j 

if dejtax == 0 & deltay == 
error( ' Data into Convexity test 1S 0 or dup1icated，) 

eng 

if deltax <=0 & deltay >= 055 Bottom_right Should be on + Side. 
Vector product = dot(w，bottonm_right);， 先 Inner product， 
WwW = Sign(yector_product)xW; 

86lSejlf deltax <= 0 & deltay <= 0 % Top_right Should be on + Side， 
Vector _product = dot(w，top_right ) ; 
W= Sign(vector_ product )*W; 

elSelif deltax >= 0& deltay <= 0 55Top Left Should be on + Side， 
Vector product = dot(w，top_1eft) 
WwW= Signf(vector_product)xWj 

el1Se % deltax >= 0 & deltay >= 0，S0O bottom_left Should be on + Side. 
Vector product = dot(Ww，bottom_1eft) ， 
W= Signt(vector_ product)*W; 

end 

% For the vertex at psecond to be convex， pthird has to be on the 

%5 poSslitive Side of the 1ine， 


sgntkK) = 1 

if (WwW(1)*pthirdf:，1) + WwW(2)*pthird(:，2) + wW(3)) < 0 
sgn(k) = 一 1 ; 

end 


end 


point 1S on the negative Side of the 1Line，then WwW 1sS replaced by 一 W， 
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function WwW = polyedge(p1，p2) 
% 0Outputs the coefficientS of the ine pasSing through pt1 and 
先  p2， The line 1S of the form WfX + W2Y + W3 = 0， 


pl 1 YI=Pp1(:， <) 
; = p2(:，2) 


三 
寻 川 计 
| 
一 
2 
一 人 


W3 = 二 
el1Seif yY1==y2 
W1 


全 
[ 
IN Hi 弓 


0 

WwW3 = ] 
elSeif xX1 == yY1 & X2 == Y2 

w{ = 1 

W2 = 1 

0 


扩 上 具 


(y1 - y2)1(xt*(Y2 - yY1) 一 Y1*(Xx2 一 X1) + epsh 
~W1x(x2 一 X)7(Y2 一 Y1) 
1 ; 


局 
fa 
4 


function [xg，yYg] = dupgone(X，yI) 
% ElLiminates duplicate，adjacent rows in [X yY]，except that the 
5% first and 1ast rows can be equal So that the polygon :SS C10Sed， 


Xg 三 Xi 
= y， 
if Size(xXg，1) > 2 
I = Tind((x(1:end-1，:) == 
(y(1:end-1，:) == y(2:end，:) 
1] ; 
[ 


X(2:8end，:)) & ... 
) ) ; 


闪 

2 
na 
4 一 
一 一 

[| 


end 


R 


function TIxn，yn] = randvertex(X，y，npix) 

%RANDVERTEX Adds random noise to the Vertices 0f a po01lygon， 

[XN，YN] = BANDVERTEX[X，Y，NPIX] adds Uniformly distributed 
noise to the coordinates of vertices of a polygon， The 
coordinates of the vertices are input in X and Y，and NPIX 1S the 
maxinmum number of pixel 10cations by which any pair (X(I) YL ) 
is al1lowed to deviate，For examp1e，jif NPIX = 1，the loOcation of 
any X(i) will not deviate by more than one pixel location In the 
x-direction，and sjimilarly for Y(1)，Noise is added independently 
to the two coordinates ， 


oo co oo obe de co oo oo 
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phit1) = eeta20 + e.eta02; 
phi(2) = (8e.eta20 -8e.eta02)^2 + 4*e.eta11 2 
phi(3) = {e.eta30 -- 3x*e.eta12)^2 + (3*e,eta21 一 6.eta03) “2 | 
phi(4) = {e.eta30 + eetal2)^2 + {(e.eta21 + 8.eta03) “2 
phi(5) = (e.eta30 - 3x*e,etal2) * (e.eta30 + 6.etal2) * ,，,， 
( {e.eta30 + e.eta12)^2 - 3*(e.eta2l + 6.eta03) “2 ) 十 
(3*e.eta21 - ee.eta03) * (e,eta21 + 8e,eta03) * ..， 
( 3x*(e.eta30 + e.etal2)^2 - (eeta21 + 8,.eta03) “2 ) 
phif(6) = (e.eta20 ~ eeta02) * ( (8e,eta30 + 6.etal2)j^2 一 ，.， 
(e.eta21 + 8e,eta03)^2 ) + ，,， 
4*e.etall * (eeta30 + e.etal2)j) * (8e.eta2l1 + 8.eta03) 
phi(7) = (3*e.eta21 一 6e,eta03) * {(e.eta30 + eeta12) * ，,， 
( {e.eta30 + eetal12) “2 ~ 3*(e,eta21 + 6.eta03) “2 ) 二 ,，,， 
(3*e.etat2 -ee,eta30) * (e.eta21 + 6,.eta03) * ,，,， 
( 3*(e.eta30 + 6e.etalt2)^2 - (eeta21 + e.eta03)^2 ) 
内 


function [X，yj = minperpoly(B，Ccel1size) 

%MINPERPOLY Computes the minimnum Perimeter p01Lygon. 

[X，Y] = MINPERPOLY{(F，CELLSIZE) computes the veprtices in [X，Y] 
of the minimnum perimeter polygon of a Single binary region Or 
boundary in image B，The procedure is based on Slansky S 
Shrinking rubber band approach，Parameter CELLSIZE determines the 
Silze 0f the Square cel1s that enclose the boundary of the region 
In 8B. CELLSIZE must be 3a nonzero Integer greater than 1 ， 


Tihe algorithm 1S applicable only to boundaries that are not 
Self-intersecting and that do not have one-pixel-thick 
protrusions ， 


co opoocoeceacede oo oo oo 


if Cel1lsSize <= 1 
error( CELLSIZE must be an integer > 1，) | 
end 


FiJlL B in case the input was provided as a boundary. Latenr 
the boundary will be extracted With 4-cConnectivity，which 
is _ required by the algorithm。，The Use of bwperim asSures 
that 4-Connectivity is preserved at this point， 
imfilLlL(B， "holes ) ; 
bwperim(8) ; 
M，N] = Sizef(B) ; 

Increase lmage Slze SO that the image 1S 0f S1ze K-by-K 
With (a) K >= maxfM;,N) and (bj) KK/cellsize = a power 0f 2. 
nextpow2(max(M，N)/7celisize) 

(2^K)*Cel1Size; 
Increase lmage Size to nearest integer power of 2，by 
appending zeros to the end of the image. Thls Widl1 al LowW 
quadtree decompositions as Small as Cells 0f Size 2-by-2， 
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Which is the Smallest alLLowed value of cellsize， 


K 一 从; 
K 一 Ni 


padarray(B，[MN]，'"post'); 点 fis now of size K-by -kK 


ob 办 之 三 de 
1 1 HH 


Quadtree decomposition， 
= gtdecomp(B，0，cellsize)， 


5 Get all the Subimages Of Size cellsize-by-cellsize， 
[vals，r，C] = qtgetblk(B，Q，gcellsize) ; 


% Get all the Subimages that contain at least one black 
Pixel，These are the cel1ls of the wall enclosing the boundary ， 
find(Sum(Sum(vals(:，:，:)) >= 1))， 

PIT) ; 

CT) ; 


[X ，yY] is a length(I)-by-2 array， Each member of this array 1S 
the left，top corner of a black cell of size cellsize-by-Ccel1lsize. 
Fiil the cells With black to form a cl1osed border of black cel1ls 
around interior points. These cells are the cellular Comp1Lex ， 
for k = 1:1engthfI) 

B(X(Ki:X(K) + celilsize-{1，y(k):yY(K) + Cell1Ssize-f) = 1]; 
end 


BF = imfill(B， "holes' ); 


“ 必 X 上 co 


村 用 


cc op 呢 go 


各 EXtract the points interior to the black border， This is the region 
so0f interest around which the MPP will be found， 

B=BFR&(-B); 

务 Extract the 4-cConnected boundary， 

8B = boundaries(B，4，'cwW' ) ， 

多 Find the largest one in case of parasitic regions . 

J = cellfun( 1ength' ，B) ; 

IT=Tfind(J == maxfd)); 

B = B{I(1T)) 

% Function boundaries outputs the last coordinate pair equal to the 
点 firSst. Delete 让 t， 

B = B(1:end-1,，:); 

5 0btain the Xy coordinates of the boundary . 

X=B:，1)) 

六 二 2) 


Find the Smallest x-coordinate and corresponding 
smallest Y-Coordinate ， 

CX = find(x == min(X) ); 

cyY = find(y == minf(y(Ccx)))， 


oR oo 


The Cell With top 1eftmost Corner at (X1， y1) below is the first 
point Considered by the algorithm.， The remaining points are 

%5 ViSited in the clockwise direction Starting at (xf，Yy1) ， 

X1 = X(CXx(1)); 

Yy1 = yY(CcY(1) ) 


用 生 
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Scroll data so that the first point is (xl1，y1)， 
find(x == X1 &yY== Yy1) 

Circshift(Xx，[-(I 一 1)，0]) 

circshift{y，[-(I 一 1)，0])) 


5%5 The Sanme Shift applLies tO B， 
B = circshift(B，[{-(I -1)，0])) 


The first row of B is the required 


| Go 
革 用 


日 


s%5 Get the Freeman chain code ， 
% Starting point，、The first element of the Code 1LS the transition 

% between the 1st and 2nd element of B，the second element 0f 

% the code is the transition between the 2nd and 3rd elementS of B， 
% and so on， The last element of the code is the transition betWween 
s% the last and 1st elements of B，The elLlements of B form 3a CW 

% Sequence (See above)，SO wWe Use 'Same ”for the direction in 

% function fchcode ， 
fchcode(8，4， 
code ,fcc ; 


code -Same ) ; 


code 


% FolLlow the code sequence to extract the Bl1ack DotSs，BD，(COnVex 

% Corners) and White Dots，WD，(concave corners) ，The transitions are 
% as follows: 0-to-1=WD; 0-to-3=BD; 1-to-0=BD; 1-to-2=WD; 2-to-1=8D， 
% 2-to-3=WD; 3-t0-0=WD; 3-to-2=dot. The formula t = Te 一 Second 
% gives the following unique values for these transjitions: 一 | ， 一 3，2， 
鬼 
区 
移 


0,，3，1，6，4. These are applicable to travel in the CW direction， 
The WD's are displaced one-half a diagonal from the BD S to form 
the half-celLl expansion required in the a1g0rithm， 


% Vertices Will be computed as array "vertices"” of dimension nV-by-3， 
% where nv is the number of vertices. The first twWo 6lementSs 0f any 
% row of array vertices are the {(X,y) coordinates of the Vertex 

% corresponding to that row，and the third element is 1 if he 

% Vertex is Convex ({(BD) or 2 计 让 is concave (WD).， The first Vertex 
% is known to be convex，SO it 1S black， 

vertices = [X1，y1t，1] 


n =1; 
K = 1 
for 本 2:Length(code ) 


if code(k -- 1) -= Codet(k) 
n=nr+ 于; 


t = 2xcodetkc1) - code(k); 名 5 = Value of formula， 
ift==-3lt==2|t==3|t==45%Convex: Black DotsS， 
vertices(n，1:3) = [x(k)，y(K)，1]) 
elseif t== -1 t==0|t==11|t==65Concave: White bots， 
if 二 == 一 
verticesfn，1:3) = [X(k) - cellSize，y(K) 一 Cel1Size,2]; 
8JLSelf tt== 
vertices(n，1:3) = [Xx(k) + cellSize，y(Kk) 一 CeIdl1S1Z65 2 
elSelf t+t==|1 
vertices(n，1:3) = [X(k) + cel1lSize，yY(K) + CelL1Sizey2] ; 
61Se 
vertices(n，1:3) = [X(K) - cellSize，yY(K) + cel1Size21]; 


end 








直 
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el1Se 
% Nothing to do here， 
end 
end 

end 
%5 The rest of minperpolLy.m procesSeSs the vertices to 
% arrlve at the MPP， 
flag = | 
While flag 


enga 


名 Determine Which vertices Lie on or inside the 
%5 Polygon whose vertices are the Black Dots，Delete all 
所 Other points ， 
= find(vertices(:，3) == 1); 
XV = .Verticesf{I，1); 多 人 of the BlLack Dots ， 
yV = Vertices(I，2); 
X = Vertices(:，1); 5% Coordinates of all vertices . 
Y = vertices(:，2) |; 
IN = inpolLygon{(X，Y，XV，YyYV) 
I = find(IN ~= 0); 
Vertices = verticesS(I， !: ) 


s% Now check for any Black Dots that may have been turned into 
各 Concave Vertices after the previous deletion step，Delete 
多 any Such BJLack Dots and recompute the polygon as in the 

% previous Section of code， When no more changes 0ccur ，Set 

% flag to 0，which causSes the 100p to terminate . 

X = Verticesf{:，+); 

yY = Vertlices(:，2); 

ang16es = polyangles(X，y); % Find all the interior ang1les， 

TI = 了 find(angles > 180 & vertices(:，3) == 1); 

If 1Sempty(TI) 


flIag = 0; 
elSe 
J = 1:1ength(vertices ) ; 
for KkK = 1:1ength(I) 
K = find(J ~= I(K)); 
JJ = JUJ(K) ; 
end 
Vertices = Vertices(d，:) | 
end 


% Final pass to dejlete the vertices With angles of 180 degrees. 


X = Vertlces(:，1); 

y = Vertices(:，2); 

ang1leS = polyangleS(X，yY) 
= find(angles ~= 180) ; 


。 Vertices of the MPP: 


人 


Vertices{(I，1) ) 
vertices(I，2) |; 
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Convert to Columns . 
= X( 1) 

= yY() 

5 Prelinminary calculations. 

L = Jength(X) ; 

xnolise = rand{(L，1+) 

ynoise = rand(L，1); 

xdev = npix*xnoise.xsign(xnoise 一 忆 
ydev = npix*ynoise,*sign(ynoise -- 日， 


和 
X 


右 Add noise and round , 
xn = round(x + xdev)， 
yn = round(y + ydev) |; 


5 Al1 pixel 1ocations must be no Less than 1. 
xn = fax(xn，1)， 
yn = max(yn，1T); 


》 


function [st，angle，x0，y0] = Signature(b，varargin) 

5%5SIGNATURE Computes the signature of a boundary， 

[3ST，ANGLE，X0，Y0] = SIGNATURE(B) Computes the 

Signature of a given boundary，B，where B js an np-by-2 array 
(np > 2) containing the (X，y) coordinates of the boundary 
ordered in a clockwise or counterclockwise direction The 
amplitude of the Signature as 8a function of Increasing ANGLE is 
output in ST，(X0,Y0) are the coordinates of the centroid of the 
boundary，The maximum size of arrays ST and ANGLE is 360-by-1， 
lndicating a maximum resolution of one degree，The input must be 
a one-pixel-thick boundary obtained，for examplIe，by Using the 


[ST，ANGLE，X0，Y01] = SIGNATURE(B) computes the signature，using 
the centroid as the origin of the Signature vector. 


[ST，ANGLE，X0，Y0] = SIGNATURE(B，X0，Y0) computes the boundary 
USing the Specified (X0，Y0) as the origin of the Signature 
Vector ， 


oOR oem oedcede do op wo co oo oo op oo da de Go oo 


s% Check dimensions of b.， 
[np，nc]l = Sizefb); 
If (np<nc | nc-~=2) 
error( B must be of Size np-by-2.')， 
end 


oOe 


Some boundary tracing progranms，Such as boundaries .mn，end Where 
they started，resulting in a sequence in which the coordinates 
of the first and last points are the same， If this 1S the Case， 
In b，eliminate the last point， 

fsegual(B(T (RD 志和 

b = bl1:np -1，:)， 


oo oo 吕 


oo 


function boundaries，By definition，a boundary is a closed curve. 
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np=np 一 1; 

end 

% Compute parameters ， 

if nargin == 
x0 = round(sum(b(:，1))Anp);，% Coordinates of the centroid. 
Y0 = round(Sum(b(:，2))/np); 

elSeif nargin == 


xo = Varargin{1ij 

Y0 = Vararginf21}; 
else 

error( Incorrect numben of InputSs，) 
end 


% Shift origin of coord System to {(xX0，Yy0) )， 
(1)=b(:，1) -x0; 

b(: ， 2 二 0 

s% Convert the coordjnates to polar， But first have to Convert the 

% given image Coordinates，{(X，y)，to the coordinate System Used by 

% MATLAB for conversion between Cartesian and polar cordinates， 

s# DeSlgnate these coordinates by {(XC，yCcC). The two coordinate Systems 

名 are related as fo0llowS: Xc = yand yc = 一 X， 

xcC = hb(:，2); 

yc = -b(:，1); 

[theta，rho]l = cart2pol(xXC，Yc) 


% Convert angles to degrees， 
theta = theta.,*(180/piy); 


它 


% Convert to all1 nonnegative ang1es. 
] = theta == 0; % Store the indices of theta = 0 for use below， 
theta = theta.*(0,5*abs(1 + Sign(theta) )),.， 
一 0.5*(-1 + Signf(theta)).,x*(360 + theta) ; 
theta(j) = 0; 各 To preserve the 0 values， 


cy 
和 八 3 
及 


temp = thetai 

% Order temp So that Sequence Starts With the smallest angle， 
% This Will be Used below in a check for monotonicity， 

I= find(temp == minttemp) ) ; 


croll up SO that Sequence Starts with the smallest angle. 
Se (1) in case the min is not Unique (in this case the 
equence Will not be monotonic anyway ) . 

mp = Circshift(temp，[-(I(1) ~- 1 人) ，0]); 


名 Check for monotonicity，and issue a Warning if sequence 
委 1s not monotonic， First determine if Sequence is 
% CW OF CCW， 
kf = abs(temp{(1) - temp(2)); 
k2 = abs(temp(1) - tempf(3)); 
if k2 > Ki 
SensSe = 1; 移 CCW 
elSelif KkK2 < ki 
Sense = 一 {; 各 CW 
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点 昌 
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6LSe 
warning([ The first 3 points in B do not form a monotonic 
-Sequence，] ) ; 
end 
5%5 Check the rest of the sequence for monotonicity，Because 
s% the angjles are rounded to the nearest integer later in the 
s% program，only differences greater than 0.5 degrees are 
务 ConSidered in the test for monotonicity in the rest of 
务 the Sequence . 
fljag = 0; 
for kK = 3:1ength(temp) = 1 
diff = SenSex(temp(k + 1) 一 temp(k) ); 
if diff < 一 .5 
flag = 1 
engd 
end 
if flLag 
warning( Angles do not form a monotonic Sequence，' ) ; 
end 


% Round theta to 1 degree increments ， 
theta = round(thetay) ; 


5%5 Keep theta and rho together， 
tr = [theta，rho]; 


% Dejete duplicate angles， The unigque operation 

5%5 alSo Sorts the input in ascending order， 

[w， UV] = uniIqueltr(:，1)); 

tr = trtu)i 和 多 Uidentifies the rows kept by unique， 


二 JIf the last angle equals 360 degrees plLUS the first 
% anglje，delete the 1ast ang1l1e， 
if trf(end，1) == tr(1) + 360 
人 
end 


% Output the angl1e values， 
angle = tr(:，1); 


s The Signature jls the Set of values of rho _ corresponding 
乞 to the angle Values . 
st = tr(:，2); 


function [srad，Sang，S] = Specxture(f) 

%SPECXTURE Computes Spectral texture of an image 

各 [SRAD，SANG，3] = SPECXTURE(F) computes SRAD，the Spectral energy 
% distribution as 8a function of radius from the center of the 

乞 ”Spectrum，SANG，the Spectral energy distribution as a function of 
% angle for 0 to 180 degrees in increments of 1 degree，and S = 

溃 10g(1 + Spectrum of f)，normalized to the range [0，1]，The 

5 maximum value of radius is main(MNN)，where M and N are the number 
当 0f rows and Columns of :image (region) 于 ，Thus，SRAD is a row 

当 Vector of length = (min(M，N)/2) - 1 and SANG is a Fow vector of 
名 1ength 180， 





和 
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% 0btain the centered Spectrum，3S，of ff，The variables of S are 

ss (uU V)，running from 1:M and 1:N，With the Center (Zero frequency) 
sat [M/2 + {1，N/2 + 1] (S$ee Chapter 4)， 

S = fftshift(fft2(f)) | 

S = abs(39); 

[M，N] = Sizel(S)， 

X0 = M/2 + 1; 

N/12 二 二: 

务 Maximum radius that guarantees a circle centered at (xX0，yY0) that 


s do08eS not excCeed the boundaries of 9， 
rmax = min(M，N)72 于 |; 


cc 
< 
上 


和 5 Compute Srad , 
Srad = zeros{( 了 了 ，rmax ) ; 
sradf(1) = S(X0，yYO0) |; 
for mn = 2:rmax 
fxc，yYcj = halfcircle(r，x0，yYO0) 
sradtr) = Sum(S(SUb2ind(Size(S)，XxC，yC))) 
end 


5 Compute Sang， 
[XC，yYcj = hajlfcircle(rmax，X0，yY0) ; 
Sang = zeros(1，1lLengthf(XxCc) ); 
for a= 1:1ength(xc) 
[xfr，yr]l = radlial(x0，Y0，Xxc(a)，yYc(a) ); 
sang(aj = Sum(S(Sub2ind(Size(S) ，Xxr，yr))) 
end 
5 Output the 10g of the Spectrum for easier Viewing，sScaled to the 
名 range {0，1]. 
3S = mat2gray(1og(1 + 3S)); 


function [xc，yYCc] = halfcircle(，Xx0，YO) 
Computes the integer coordinates of a half circle of radius FF and 
Center at (x0,Y0) USing one degree increments . 


60es from 91 to 270 becauSe We want the half circle to be in the 
region defined by top right and top 1eft quadrantSs，in the 
standard image coordinatesS ， 


co ob oo oo do ce 


theta=91:270; 

theta = thetax*pIi/1180; 

[xXC，yYC] = pol2cart(theta，P) 

= Pound(xXC) ”+ X0; 点 Column Vector， 
YC = Found(yCc) ”+ yYO0; 


function [xr，yr]l = radial{(xX0，yY0，X，Y) 
%5 Computes the coordinates of a Straight 1ine Segment extending 
5% fromn (X0，yY0) to {X，Yy) ， 
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右 8Based on function intline.m。，Xxr and YF are 
% Peturned as COLIumn vectors 


[xr，yYr] = intline(x0，X，y0，yY); 


function [v，unv] = statmoments(p， mn) 

53SJATNMONENTS Computes statistical central moments of image histogram. 
[W，UNV] = STATMOMENTS(P，N) computes up to the Nth statistical 
Central moment of a histogram Whose components are in vector 
P，The length of P must equal 256 or 65536， 


The program outputs a vector V with V(1) = mean，V(2) = variance， 
V(3) = 3rd moment，， ，VI(IN) = Nth central moment，The randonm 
variab1e values are normalized to the Pange [0，1]j，So al1 
moments al1So are in this range， 


The program also outputs a vector UNV containing the Same moments 
as V，but Using un-normalized random variable values (e.g.，0 to 
255 if Jength(P) = 2^8)， For examp1e，if length{(P) = 256 and VI(T) 
= 0.5，then UNV(1) would have the value UNV(1) = 127.5 (haJlf of 
the [0 255] Prange ) ， 


oo ceoecco oo de de do oo co oo de co 


Lp = jength(p); 
if (Lp ~= 256) & (LpP -~= 65536) 
error( P must be a 256- or 65536-element vector,')， 
end 
G=Lp 一 1 


5 Make Sure the histogram has unit area，and convert it to a 
乞 COJumn Vector， 

p = p/Sum(p)i p= pl )i 

所 Form a Vector of all the possible values of the 

沁 Pandom variab1le， 


Z= 10:6; 

5 Now normalize the z'S to the range [0，1]， 
二 

和 5 The mean， 

mm = Z”*p; 

s%5 Center random variab1les about the mean， 
Z= 了 2Z 一 由 


先 Compute the central moments ， 
= Zeros(1，Dn) ; 


if nargout > 1 
Compute the uncentralized moments， 
Unv = zeros(1，n); 
unv{(1T)=m,*Gi; 


[ 
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Unv(j) = ((z*G).j)*p; 


end 


function [tj = statxture(f，SCcale) 

5%STATXTURE Computes statistical measures of texture in an image， 

T = STATXURE{(F，SCALE) computes Six measures of texture from an 
image (region) FF. Parameter SCALE is a 6-dim row vector Whose 
elements multiply the 6 corresponding elements of T for scaling 
purposes. 于 SCALE is not provided it defaults to all 1S$， The 
output 了 is 6-by-1 vector with the following elements: 


co oocececoedod de do do 


T(T) = Average gray 1Level 
T(2) = Average Contrast 
T(3) = Measure of Smoothness 
T(4) = Third moment 
T(5) = Measure of uniformity 
T(6) = Entropy 

if nargin == 


Scalel(1:6) = 1; 
el1Se 5 Make Sure 让 'S a row Vector， 

SCale = Scalel(:) '; 
end 

obtain histogram and normalize it， 

= imhist(f); 
= p,.VnumelI(Tf) ; 
Length(p) ; 
% Compute the three moments，We need the unnormalized ones 


光 from function statmoments，These are in Vector mu， 
[v，mu]j = StatmomentS(P，3) 


和 
P 
P 
L 


% Compute the SiXx texture measures: 

各 Average gray Leve1， 

t(1T) = mu(1T); 

各 Standard deviation. 

t(2) = mu(2).^0.5; 

务 Smoothness ， 

名 First normalize the variance to [0 1] by 
s% dividing it by (L-1)^2， 

arn = mu(2)/7( 人 L ~ 1)^2; 

(3) = 1 一 1/1+varn); 

5 Third moment (normalized by (人 L - 1)*2 also)， 
t(4) = mu(3)71(L - 1)^2; 

Uniformity ， 

tt5) = Sum(p. 2) 

% Entropy， 

t(6) = ~Sum(p,*(1092(P + 8eps))); 


加 Scale the valLues. 
t = 七 .*SCale; 


r 十 到 
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function [B，thetal = X2majoraxis(A，B， type ) 

s%5X2MAJORAXIS ALigns coordinate X With the major axis of a region. 
[B2，THETA] = X2MAJORAXIS(A，B，TYPE) all1gns the X-coordinate 
axis With the major axis of a region Or boundary，fTne yY-axiS 1S 
perpendicular to the X-axiSs， The rowSs of 2-by-2 matrix A are the 
coordinates of the two end points of the major axis，jin the form 
A= fxl yt; x2 y2]. On input，B is either a binary image (1I.86.， 
an array of class 1ogical) containing a single region，or 让 革 S 
an np-by-2 Set of points representing 8 (connected) boundary，In 
the latter Case，the first Column of B must represent 
x-coordinates and the second column must represent the 
corresponding y-coordinates， On output， 昌 Contains the Same data 
as the jinput，but aligned with the major aXiS， If the input 1S an 
image，sSo is the output; Similarly the OutPUT Is 8 Seduence of 
coordinates if the jinput is Such a sequence， Parameter THETA 1S 
the jnitial angle between the major axis and the X-axlis，The 
origin of the Xy-axiSs System is at the boftton Left; the X-axiS 1S 
the_ horizontal axis and the y-axis is the Vertical， 


Keep in mind that rotations Can jintroduce round-off errors When 
the data are Converted to integer coordinates，wWhich is 8 
requirement. Thus，postprocesslng (e.g,， with bwmorph) of the 
output may be required to reconnect a boundary ， 


ddeoedecoooeoeon ooce ceoc 吕 吕 中 de 


% Preliminaries， 
计 isS10gical(B) 
type = “Pregion ; 
elSeif Size(fB，2) == 2 
type = “ boundary ; 
[M，N] = Sizel(B); 
ifM<N 
error('B is boundary， It must be 0f Size np-by-2; np > 2，) 
end 
%5 Compute centroid for later USe, C is 8 1-by-2 Vector， 
% Tts 1st component is the mean of the boundary in the X-difrection， 
% The second is the mean in the y-direction， 
round( (min(B(:，1)) + max(B(:，1))72))) 
round((min(B(:，2)) + max(B(:，2))72)) 


CT 
C(21) 
% It is possible for a connected boundary to develLlop Small breaks 

% after_ rotation. To prevent this，the input boundary is filLlLed， 

% processed as a region，and then the boundary is re-extracted，Thls 
% guarantees that the Output Will be a connected boundary， 


mi = Pax(Size(B) ) 

% The follLowing image is Of Size 由-by- 几 to make SUre that there 
% there will be no size truncation after rotation， 
bound2im(Bymy 有 ) 

imfilI(B，holes ) 


Oo CO 
攻 
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elLSe 
error('Input must be a boundary or a binary jimage .,) 
end 


% Major axis in Vector form， 

V(1) 了 A(<， 1 ) 人 A(1， 1) 

V(2) 二 A( 的 2) 一 人 (1 2 
=Vv(:);) 名 Visa col vector 


V 
% Unit vector along X-axisS， 
U= [1 0]， 


% Find angle between major axis and Xx-axis， The ang1e 1S 
% given by acos of the ;inner product of U and Y djivided by 
% the product of their norms，Because the inputs are Image 
% pojints，they are in the first quadrant， 
nv = norm(v) ; 
nu = noprm(U) ) 
theta = acosS(U'*VAnv*nu) ; 
if theta > pi/2 
theta = 一 (theta 一 pz/2) 
end 
theta = thetax*1807p1I 


% Rotate by angle theta and crop the rotated image 蒂 0 original Size . 
B = imrotate({B，theta， "bilinear ， Crop ) 


% Convert angle to degrees. 


% If the input was a boundary，re-extract jt， 
ji strcmp(type， boundary ) 
B = boundaries(B) ; 
B = Bf{+1}; 
% Shift so that centroid of the extracted boundary 1S 
% approx equal to the centroid of the original boundary : 
8(:， 1) =B(: 1 一 minB 1)) +c(Di 
B(，2) = B(:，2) -~ min(B(:，2)) + C(2) 1 


end 
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abs, 112 
Accumulator cells. See Hough 
Transform 
Adaptive learning Systems, 498 
Adaptive median filter, 164 
Adjacency, 408,413,427 
adpmedian, TI63 
Affine, 183 
transform (Visualizing), 186 
transformation matrix，188 
all1, 46 
Alpha-trimmed mean fljter， 160 
Alternating sequential filtering， 


371 
AND, 45,.337 
ans, 48 
any, 46 


appcoef2, 201 
apply1lut, 323 
Arithmetic mean fijlter, 160 
Arithmetic opeTatorTs, 40 
ATray, 12 
arithmetic operators,. 41 
dimensions, 37 
editor, 8 
indexing, 30 
operations Summary, 5J4-520 






Artificial inteiligence, 3 
ASCII code, 294 

@, 97 

Average value, 138 
aXxi1S，78 


bar ,77 

Basic rectangje, 456 

Bayes classlfier, 492 

bayesgauSss, 493 

Between-class Varlance, 405 

bin2dec, 300 

Binary Image, 24, 25 

Bit depth, 194 

blLanks, 499 

BLAS, definition of, 4 

Blind deconvolution. see 
Restoration 

bl1kproc; 321 

BLPF, 129 

BMP 135 

Book Web site, 6 

Border. See Reglon 

Bottomhat transformation. ee 
Meorphology 

bound2eight. 434 

bound2four,434 


bound21m, 433 

boundaries, 434 

Boundary, 426. See dso Reglon 
descriptors, 425 
Segments, 452 

break, 49 

Brightness, 207 

bsubsamp, 433 

Butterworth. See also Filtering 

(frequency domain) 

highpass filter, 130 
lowpass filter, 129 

bwdist, 418 

bwhitmiss, 392 

bw1Label1, 361 

bwmorph, 336 

bwperim, 445 
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C MEX-fije, 305 

Canny edge detector 389 

cart2p01;, 451 

Cartesian product, 333 

Cat, 199, 435 

Catchment basin. ee Morphoilogy 

CCIIT.21 

CDF See Cumaulative distribution 
function 
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CeilI.T]14 

〈( jarray. 48.62, 427 
L-indexing, 292 


Cell1.292 
C .LIdisp.293, 428 
C .JIfun, 428 


Cell1p1ot, 293 
Cellstr, 499 
(jlujar.see Minimum perimeter 
polygon 
complex, 440 
mosalc. 440 
(《 iin codes. ee Freeman chain 
codes 
changeclass,26.72 
Cr,23,24,.61.499 
cc 'Ckerboard, 167 
CLPCShift, 433 
CirculLar,94 
(aring border objects. See 
Morphoiogy 
Cjose-open filtering. See 
Morphology 
SIng-by-reconstruction. gee 
Morphology 
Code optimization, S$. See alyo 
Vectorizing 
je words. see Compression 
Lvjngredundancy ee 
Compression 
2 人 22 
9 
Colon,31,41 
Indexing, 43 
.orfimage Processling)， 
194--241 
basics of processing, 215 
hit depth, 194 
rightness, 207 
-MY color space, 206 
CMYK color space, 206 
ojor balancing, 224 
olor contrast, 222 
color correction, 224 
cojor cube. 195 
Dlor edge detection, 232 


color mappings, 215 
color Pixejls, 194 
color segmentation, 237 
color vector gradient, 232 
colormap, 197, 199 
colormap matrix, 197 
cojors, 195 
component images, 194 
coOnversion between color 
models, 204-215 
function sSummary 514, 920 
histograms, 225 
了 SI color space, 207 
也 3Y color Space, 205 
hue. 204.207 
Image representation in 
MATLAB, 194 
Image sharpening, 230 
Image Smoothing, 227 
Images, 12. 194-215 
iadexed images, 197 
interactive color editing (ICE)， 
218.527-551 
IPT color functions, 199-204 
NTISC color Space, 204 
number of RGB colors, 195 
Primary colors, 195 
ITepresentation in MATLAB， 
194 
RGB coior space, 195 
RGB images, 194 
Saturation, 204, 207 
Secondary colors. 195 
segmentation In RGB vector 
Space, 237 
Sharpening. 231 
smoothing, 227 
Spaces, 204 
transformations, 21S$-227 
Vector Processing, 215 
YCbCr color space, 205 
Colorgrad.234 
Colormap、132 
colorSseg, 238 
Column vector, 14 
Columns, 13 


Command 
history, 7 
History window, 9 
hne, 1 
Prompt, 8 
Window, 7.8, 14 
Comment lines, 40 
Comiments, 39 
compare,285 
Complement 
image, 42, 67 
set, 335 
Component images, 194 
Compression, 282-333 
binatry code, 289 
biock code, 290 
blocking artifact, 323 
code word, 286, 290 
coding redundancy, 282, 286 
compression ratio, 21, 283 
decoder, 283 
diftferential coding, 313 
encoder, 283 
entropy, 287,288 
error free, 285 
first-order estimate, 288 
Hufftman coding and decoding， 
289-309 
Implicit quantization, 327 
Improved gray-scale (TGS ) 
quantization, 316 
Information preserving, 285 
information theory, 287 
instantaneous code, 290 
interplxel redundancy, 282, 309， 
310, 314 
Inverse mapper, 280 
JPEG baseline coding System， 
318 
JEEG comPpression, 317-333 
lossless, 28S,313 
lossy comPression, 285 
mapper, 286 
mapPpings, 310 
Packbits. 21 
prediction error. 314 
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Predictive coding, 310, 313 
predictor, 311 
prevlous pixei coding, 313 
psychovisual redundancy, 282， 
313 
quantization, 315 
quantizer, 286 
redundancies, 282 
Symbol coder, 286 
Symbol decoder, 286 
transtform coding, 318 
transform normalization array， 
319 
uniquely decodable, 290 
variable-length coding, 282， 
295.310 
Computer vision, 3 
Computer, 48 
Concave vertex, 441 
Connected component, 359, 360， 
408, 422, 426. See also 
Morphology 
Connectivity, 408, 455 
minimally connected, 427 
Connectpoly, 435 
Constants. See MATLAB 
Constrained least squares 
fltering, 173 
Continue, 49 
Contour. See Region 
Contraharmonic mean filter， 
160 
Contrast, 480 
enhancement, 373 
stretching transformation, 68 
Control points, 191, 217 
choosing interactively, 193 
Conv, 94 
Conv2, 25S7 
Converting 
between data classes, 25 
between image classes, 26 
between image types, 26 
colors from HSIto RGB.211 
colors from RGB to HIS 209 
to other color spaces, 204 


Conyvex 

deficiency 452 

huil, 452 

Vertex, 441 
Conyolution, 89. 90. 92 115 

fijter. 89 

kernel, 89 

mask, 89 

periodic functions, 116 

theorem, 115 
Conway's Game of Life, 354 
COonway1awSs, 355 
Coordinate convyentions, 13 
Coordinates.、12 
corr,94 
Correlation, 90-95 

between pixels, 282 

mask, 490 

matching, 490 

template, 490 
Covariance matrix, 238. 475. 486 
covmatrix, 476 
Cp2tform、192 
cpSelect, 193 
Cumulative distribution function， 

81,144, 220 

table of 146 
cumsum, 82 
CurTent Directory 7,8.15 
CUrTrent Directory window, 8 
Cygntus Loop, 416 
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Data classes, 23 
coOTiverting, 25 

Data matrix, 197 

DC component, 109 
average value, 109. 138 

DCT, 318 

了 十 Ctmtx, 321 

dec2base, 908 

dec2bin, 298 

jDecision. See also Recognition 
boundary, 488 
function, 488 


淡 Index “599 
Decoder see Compression 
deconvblind, 180 
deconvIucy, 177 
Deconvolution. 142 
blind, 166, 179. 180 
Wiener, 173 
deconvreg, 175 
deconvwnr, 171 
Degradation function, 142 
estimating, 166. 179 
modeling, 166 
Delimiter, 61 
Description, 455-483 
basic rectangle, 456 
boundary, 455 
boundary length, 455 
boundary segments, 452 
diameter, 456 
eccentricity, 456. 465 
end points, 456 
Fourier descriptors, 458 
IPT function regionprops, 463 
major axis, 456 
ImInOIT axis, 4S6 
moment invariants, 470-474 
Principal components， 
474-483 
Shape numbers, 456 
statistical moments, 462 
texture, 404-470 
detcoef2,.261 
DFT 108. See Discrete Fourjer 
transform 
dftcorr,491 
df 坏 flt,， T22 
dftuv, 128 
diag,239 
Diagonal neighbors. 359 
diameter, 456 
Diameter. yee Description 
dff, 373 
DIFFERENCE, 337 
Difference of sets, 336 
Differential coding. .See 
Compression 
Pigital image, yee Image 


) 名 Index 


ttion. See Morphotogy 

-37 

'UM 

sfined, 914 

Inction Summary 914-521 

,284 

:ctorles, 15 

:rete cosSine transtorm, 318 

:Tete Fourier transtorm 
(DFT), 108 

通 辆 0 此 

Yerse, 110, 114 

-lgin of 111 

aidding, 91,93, 116 

)ectrum, 70, 109 

TOmetry TI1LU 

TO padding, 116 

Timinant function. .yee 
Recosgnltion 

2, 59 

)jay Pane, 9 

)lavyling Images, 10 

amnCce 

mputing, 483 

uclidean, 17,237-239,450， 
485 ,489 

ahalanobis, 237-239,480 

ansform,418 

1er、 199 

notation, 40 

)16,24 

21 

Tee Wavelets 

node, 2932 

amijc Tange,，L6,.69, 114 

WwW, 16, 18 

anipujating, 69,81,474 

minal, 3206 


jacent, 359 

nnected path. 360 
nnectivity, 430 
ighbors,，339 

ntTicity ee Description 


Edge (detection), 384-393 
Canny edge detector, 389 
color, 232 


derivative appProxlimations, 387 


direction. 384 
double. 385 
gradient, 384 
IPT function edge, 384 
Laplacian of Gaussian (LoG)， 
388 
location, 385 
magnitude, 384 
masks, 387 
Prewitt detector, 100, 387 
Roberts detector, 388 
second-order derivative, 385 
Sobel detector, 126, 385 
ZeIO CIOSSings, 388 
edgetaper, 172 
edit, 40 
elig, 478 
Eigen axes, 450 
Eigenvalues, 475-476,480 
Eigenvector, 473-476 
axis alignment, 483,512 
Principal components, 475 
EISPACK .definition of, 4 
Encoder. ee Compression 
end, 31 
endpointSs, 33 二 
Endpoints, 350, 358, 450. ee 4150 
Description 
Enhancement, 69. 108, 141, 222， 
36.373 9137 
Entropy, 287,288.314.4606. ee 
ao Compresslon 
eps, 48.69.70 
Erlang, See Probabijity denslty 
function 
Erosion. See Morphojlogy 
error、 50 
Euclidean distance. ee 
Distance 
eval.501 
Exponential, Sec Probability 
density functilon 


下 xport, 23 
Extended 
minima transform, 424 
(padded) function, 116. see 
also Function 
External markers, 422. ee 
Segmentation 
eye, 494 


F 


4-adjacent, 359 
4-connected path, 360 
4-cormectivity, 436 
4-neighbors, 339 
Faceted Shading, 133 
false, 38 
false, 410 
fchcode, 437,446 
feval. 4135 
FFT, 112. 9ee Discrete FEourlerT 
transform 
fft2, T12 
fftshift, 112 
fieldnames,284 
Fields, 21,63, 184, 430 
Fisgure callback functions, 54> 
Figure window, 7 
figure, 18 
Filling hojes, 365. 9ee 4l50 
Morphology 
Filter(ing) (frequency domaln) 
11 和 -139 
basic steps, 121 
Butterworth,lowpPass, 129 
convolution theorem, 1135 
convolution, 115 
extended functions, 110 
filter transfer function, 115 


finite-impulse-response (下 IR)， 


122 
from spatial fiiter, 122 
Gaussian highpass, 136 
Gaussian lowpass. 129 
generating directly, 127 
high fredquency emphasis. 138 
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highpass, 127, 130 

ideal lowPpass, 129 

lowpass, 116,. 127, 129 

Imeshgrid arTrays、 128 

M-function for filtering, 122 

obtaining from Spatial fllters， 
122 

padded functions, 116 

Periodic noise filtering, 160 

pPlotting, 132 

sharpening, 136-139 

smoothing (lowpass filtering)， 
129-132 

transfer function, 11> 

wiaparound error, 116 

Zero-phase-shift, 122 

Filter(ing) (spatial), 03,89-10 /， 

158-166 

adaptive median filteT， 104 

alpha-trimmed mean filter， 
160 

arithmetic mean filter, 160 

averIaging filter, 100 

contraharmonic mean filter， 
160 

convolution, 89-94 

correlation, 90-94 

Gaussian filter, 100 

generating filters with function 
fspecial, 99 

geometric mean filter, 100 

harmonic mean filter, 160 

kernel, 89 

Laplacian filter, 100 

Laplacian of Gaussian (LOG) 
filter, 100 

linear, 89, 99 

mask, 89 

max filter, 103, 160 

mechanics of linear Spatlal 
filtering, 89 

median adaptive filter, 164 

median filter, 105, 100 

midpoint filter, 160 

min filter, 103, 160 

motion fiiter, 100 


nonlinear, 89, 90, 104 
order-statistic flter, 104 
Prewitt filter, 100 
rank filter, 104 
response of filter, 89,379 
Sobel fijter, 100 
Supported by IPT 99 
template, 89 
unsharp filter, 100 
using IPT function imfIItenf， 
99 
window, 89 
Fiitering, 65, 89, 122 
frequency domain. 9ee FilterT 
(frequency domain) 
restoration. ee Restoration 
spatial. See Filter (Spatial) 
find, 147,432 
First difference, 430, 456 
Flat structufing elements, 343， 
367-370 
fliplLr, 472 
flipud, 472 
floor, 114 
Flow control, 49 
Floyd-Steinaberg algorithm, 199 
Eolders, 19 
for, 49 
Forming patterna vectoIS, 488 
Formulation, 407 
Fourier 
coefficients, 109 
descriptors, 458 
spectrum, 70, 109. 9ee Discrete 
Fourier transform 


transform.See Discrete Fourier 


transform 
frdescp, 4959 
Freeman chain codes, 436 
Frequency 
domain, 108 
filter. See Filter (frequency 
domaln ) 
rectangle, 109 
response of FIR filter, 122 
freqz2, 123 
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fspecial,99,120,. 167 


filter list、100 
fu1L1,94,.396 


Eunction. see also M-functlon 


body, 39 
categories, 514-521 
definition line, 39 
extended (padded), 91,93, 116 
handle, 97 
IPT summary, 514-3521 
MATLAB summary, 9521-526 
optimization, 253 
padding, 91,93， 110 
zero padding, 116 
Fuzzy iogic, 4 
FWT, See Wavelets 


C 


Gamma, 67,72, 148. ee do 
Eriang 
Gaussian, 38, 86, 100, 129, 136， 
148 
exXpressions for, 146, 388 
highpass filter, 136-138 
Laplacian of 388 
lowpass filter, 129 
multivariate PDF, 493 
noise, 143, 148 
spatial filter, 100 
gca, 78 
gcf, 9540 
Generator equation, 145 
Geometric mean filter, 160 
Geometric transformatlons， 
182--193 
affine transtorm (visualizing)， 
186 
affine transformation matTrlx， 
188 
forward mapping, 187 
inverse mapping, 187 
registration, 191 
get, 218 
getfie1d,540 
getsequence, 342 


1 /时 Index 


可 下 15 
bal thresholding, 404.405, See 
also Segmentation 
10bal, 292 
I PEF, 129 
:dient, 232. 384 
-2fnlition, 232.384 
detectors. ee Edge detection ， 
Fiiter (Spatial) 
cojlor vector Space, 233 
In watershed segmentation, 420 
morphological, 369 
j nulometry. ee Morphology 
) phlical user interface (GUD)， 
59,.527-351 
Jraphics formats, 15 
TDhs, 498 
1 -.y ievel See Intensity 
ray2ind, 200,201 
1 ysScale.ee ao Intensity 
lages.ee Image. 
ee Morphology 
raySlice,.200,201 
1 /thresh, 406 
丰 
Scale, 76 
以 /LEL, 39, 927--S51 
L mainfcn, 533 
L jata, 3S39 


| ine, 10. 39 
landle, 218 
“monlc mean filter, 160 
1 39 
[ef 9 
browser, 9 
vlgator pane, 9 
taining, 9 
page. 10 
text, 40 
xt bjock, 10 
_ifrequency emphasis 
ftering,，138 
er-jevel processes, 3 


Highpass filtering, 136 

hist, 150 

histc, 299 

histeq, 82 

Histogram, 76 
equallization, 81 
Imapplngs, 225 
matching, 84 
Processing, 76 
Specification, 84 

histroi, 156 

Hit-or-miss transformation, 3S0 

h-minima transform, 374 

hold on, 81 

Hotelling transform, 475 

Hough transform, 393 
accumulator cells, 395 
line detection and linking, 401 
Peak detection, 399 

hough., 396 

houghlines, 401 

houghpeaks, 399 

houghpixelIs, 401 

hpfilter， 136 

黄 SI color space, 207 

hsi2rgb,213 

HSY color Space, 205 

hsv2rgb, 206 

HTMIL, 9 

有 iue, 204,.207 

huff2mat, 301 

Huffman, 290. See also 

Compression 

codes, 289 
decoding, 301 
encoding, 295 

Hurricane Andrew, 491 


i, 48 

ice.218 

Jnteractive color editing (ICE)， 
218, 9327-9551] 

ice_openingFcn 

Ice_outputFcn 


Jdeal lowpass filter, 129 
Identity matrix, 494 
if, 49 
ifft2. 114 
Ifftshift TI14 
计 rdescp, 439 
JIGS. ee Compression 
下 PEF 129 
im2bw, 26. 406 
im2col, 321 
im2doub1e, 26 
im2jpeg, 319 
im2jpeg2k, 327 
im2uint16, 26 
im2Uint8, 26 
imabsdiff, 42 
Imadd, 42 
imadjust, 66 
imag, 114 
Image, 2. 12, 335 
anajysls, 3, 334 
aritthmetic operators, 42 
asamatrix, 14 
background, 426 
binary, 24, 25 
blur, 166, 167 
color. 9ee Color image 
PITOCeSSing 
coOmPpresslon. ee Compression 
converting, 26 
coordinates, 12 
debiuring, See Restoration 
description, ee Description 
digital, 2, 13 
display options, $14 
displaying, 16 
element, 14 
enhancement, 65 
formats, 1 
Tay-Scale, 60, 94, 199,200, 204 
indexed,24, 197 
intensity, See [Intensity 
Imonochrome, 12,24. 66,.222.408 
morphology. See Morphology 
multispectral,479.492, 496 
nolse. ee Noise 





origin, defined, 13 
PITOCessing, 2. 3 


recognition. ee Recognition 


reglstration, 191 
reglstration, 191-193 


Tepresentatlon, 12. see also 


Representation 


restoTation. ee 人 Restoration 


Segmentation. ee 
Segmentation 


transformas. See Transforms 


tyPes, 24 


Image Processing Joolbox. ee 


ao MATLAB 
background, 1,4, 12 


comPlementary toolboxes, 4 


coordinate convention,， 13 


function Summary, $14--S21 


image representation, 13 
imapprox, 198 
imbothat, 373 
imclearborder,366 
imcl10se, 348 
imcomp1ement, 42, 07 
imdilate, 340 
imdivide, 42 
imerode, 347 
imextendedmin,424 
imnfill, 366, 432 
Imfilter,92 

options for, 94 
mfzinfo, 18 
imhist, 77 
imhmin, 374 
Imizmposemin, 424 
iml1incomb, 42,. 199 
nm 前 UItiD1LY, 42 
imnoise， 106. 143 

options for. 143 
imnoise2, 148 
imnoise3, 192 
imnopen, 348 
JImproved gray-scale (IGS) 

quantization, 316 
Impulse, 92, 122, 151. 166 
imratio,283 


imread, [4 
imreconstruct, 363,410 
imregionaLmin, 422 
imrotate, 472 
Imshow, 16 
imstack2vectors,476 
imsubtract, 42 
imtophat, 373 
imtransform, 188 
Imwrite, 18 
ind2gray, 200, 201 
ind2rgb, 200,202 
Indexed Images. ee Image 
Jndices, 32, 42, 56. 147 
inpolygon, 446 
input, 60 
Input, interactive. 59 
int16, 24 
int2str, S06 
Int32, 24 
int8, 24 
Intensity, 2, 12. 9ee al5o 
Histogram 
Imaage type, 24 
in HSI color model,207 
in imdexed Images, 197 
in Pseudocolor, 216 
in Set theory view, 335 
PDF, 195 
scalling, 75 
transformations, 09-76 
Interactive I/O, S9 
Interier 
angje, 441 
polnt, 427 
Internal markers, 422 
interp1q,217 
Interplxel redundancy. ee 
Compresslon 
Interpolation, 472 
blcubic, 188, 472 
bnear， 188, 472 
cubic spline, 217 
in faceted shading, 135 
nearest nelghbor, 188, 190, 472 
Intersection. ee Set Operations 
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int1line, 436 

intrans，73 

inv, 41,403 

[Inverse mapping, 83, 187,220 

invmoments,472 

IPTl.sSee Image Processling 
ToolPox 

IISx, 48 

iscel1, 48 

iscellstr,501 

is10ogical, 23 


】 


j , 48 

Joint Photographilc xPerts 
Group, 15 

JPEG2000,282,325, 331 

JPEG Compresslon. ,9ee 
ComPIession 

JPEG, 13, 323 

JPEG,323 

jpeg2im, 322 

jpeg2Kk2im, 330 


K 


人 Kernel, 89,243,.245, 367 


L 


Label matrlx, 361 
Labeling connected components， 
3359 

APACK, definition of 4 

Laplacian, 100, 174,230, 385 
of a Causslan (LoG), 388 
operator, 179 

1ength, 531 

Length 
of array, 51] 
of boundary, 495 
of String, 499 

Line 
detection, 381-384 
detection and linking, 403 
joining two points, 436 


奖 Index 


3 

D filter design. 317 

dspatial invarliance, 142 

mbination, 42. 139 

nformal transformation， 
188 


:quency domain filtering. ee 


Filter (frequency 
domain) 

dexing, 35 

iear PTOceSS, 142 

ction, 100 

atial filtering. ee Filter 
(Spatial) 

stems and convolution, 115 

PACK, definition of 4 

ipace, 32, 185 

这 

adient, 389 

axima of gradient, 386 

aximunm operator, 367 

inimum operator, 368 

resholding, 405,407 

flables, 292 

68 

10. 68 

2，68 

arithm transformation， 
68, 70 

LCal, 23 

ical, 23 

nctions, 45, 46 

)erators, 45 

normal. ee Probability 
density functlon 

< 于 or 10, 40 

kup tables, 353 

P, 51, 32. $ee YectoriZing 

3r,， 02 

-jevel procesSses, 3 

pass filter, 9ee Pijter 
(frequency domain) 

2mat, 3 2 

ilLter, 131 


y-Richardson algorithm, 176 


1inance. 204 


从 


magic, 38 
Mahalanobis distance. see 
Distance 
Major axis, 456. See Descriptlon 
makelut, 353 
maketform, 183 
transformations Supported, 192 
manualhist,87 
Mapping 
color, 216, 223 
forward, 187 
in CompPresslon. 9ee 
Compresslon 
in indexed images, 197 
intensities. 9ee Intensity 
transformatlons 
iaverse, 187 
inverse, 220 
in set theory, 335 
Marker 
in morphological 
reconstruction, 362 
In plots, 79 
in region segmentation, 414 
in watershed segmentation, 422 
Mask, 89,362,379. See ao Filter 
(Spatial) 
mat21PCc, 312 
mat2gray, 20,43 
mat2huff， 298 
mat2Sstr, 307 
Matching. See also Recognitton 
coOFrTelation, 490 
regular expIeSS10n, 502 
String, S08 
MAT-fijes, 11 
MathWorks web site, 5 
MATLAB 
arTay indexing, 30 
background, 4 
command line, 1 
constants, 48 
definition of 4 
desktop, 7 


editor, 9 
environrment, 7 
function sSummary 321-526 
fundamentals, 12-04 
helip, 9 
M-function. See M-function 
number representation, 49 
operators. 9ee OPeratorS 
plotting, See Plotting 
Predefined colormaps, 200 
programming, 38-04 
prIompt, 135 
retrieving work sesslon, 10 
saving WoOrK session, 10 
string manipulation, 499 
variables, 48 
Matrix, See diso ArfTay 
arithmetic Operations, 40 
dimensions, 37 
indexing, 30, 32 
max, 42 
Maximum-likelihood estimation 
(MLE), 180 
Mean, 147, 153, 198, 460 
Mean vector, 473 
mean, 362 
mean2， 73 
medfilLt2, 1006 
Medial axis, 453 
Median filter, 105, 160 
adaptive, 164 
median,， 105 
人 esh, 132 
meshgrid, 59,183 
Metacharacters, S01, 302 
mexErrMsgTxt,307 
MEX-file, 305 
M-file, 4 
和 filename, 533 
M-file. See M-function 
M-function, 4 
components, 10 
editing, 9 
help. 10 
listings, 332--593 
programming, 38-04 


ee 


人 





Mid-levejl processes, 3 
Midpoint filter, 160 
mIn, 42 
Minima imposition, 424 
Minimally connected, 427 
Minimum-perimeter polygons， 
439 一 49 
cellular complex, 440 
cellular mosalc, 440 
Minor axlis, 456 
minperpoly, 447 
MLE, 180 
Moment 
about the mean, 195, 404 
central，1335 
invarianfts, 463, 470-474 
order, 153,470 
Monochrome image. ee Image 
Monospace font, 14 
Morphology, Morphological， 
334-377 
adjacency, 359 
alternating Sequential 所 ljtering， 
| 
bottomhat transformation, 373 
clearing border objects, 366 
close-open 和 iltering, 371 
closing gray-scale images, 303 
closing, 348 
closing-by-reconstruction, 375 
connected component, 359--362 
Conway's Game of Life, 354 
dilation, 338 
eTOSion, 342 
filling holes, 363 
和 at structuring elements, 343， 
367-370 
function bwmorph, 356 
function strel, 341 
gradient, 369 
granulometry 373 
gray-scale dilation, 366 
gray-scale erosion, 368 
gray-scale morphological 
freconstruction, 374 


gray-Scale morphology， 
366-377 
hit-ormiss transtormation, 350 
h-minima transform, 374 
labeling connected 
components, 359 
look-up tables, 353 
marKker, 362 
Imask, 362 
open-close filtering, 371 
opening, 347 
opening by reconstruction, 363 
opening gray-scale images, 369 
operations, tabie of 357 
parasitic components, 358 
Pruning, 338 
Teconstruction, 302 
reconstruction, gray-Scale 
images, 374 
reflection of structuring 
element, 330 
skeletonizing, 336, 358 
strel object, 342 
structuring element, 334, 338 
structuring eiement 
decomposition, 341 
thinning, 396 
tophat-by-reconstruction, 376 
watershed. See Segmentation 
Metion, 100 
Mujtiresolution, 244 
mxArray, 307 
mxCal1loc, 307 
mxCreate, 307 
mxGet, 307 


NaN, 48 

nan, 48 
nargchk, 71 
nargin, 7I 
nargout, 71| 
ndims,37 
Negative image, 07 
Neighbor, 359 
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Neighborhood, 65, 104 
gradient, 232 
implementation, 90 
in color images, 216 
Processing, 89 

Neural network , 4, 498 

nextpow2,， 117 

n1Lfilter,， 96 

Noise 
2-D sinusolid, 150 
Erlang, 143, 146 
estimating parameters, 153 
exponettial, 148 
filtering. ee Filtering 
frequency filters. ee Ellter 

(frequency domain ) 
GOaussian, 143, 148 
generating, 143, 148, 122 
lognormaal, 148 
models, 143 
periodic, 150, 152, 160 
Poisson, 143 
Rayleigh, 148 
salt 信 pepper, 107, 143, 148 
spatial filters. %ee FijteT 

(SPatial) 
speckle, 143 
uniform、 1i48 

Norm (上 ucildean), 485 

norm, 485 

Normalized histogram, 76 

NOT, 45,337 

NTSC color space. ee Color 

ntsc2rgb, 205 

Number rePresentation, 49 

numel, 51 


0 


Object, 359. See ao Connected 
component 
callback functions, 349 
recognition, 484 
ones、J8 
Opening and closing. yee 
Morphology 


”QUEX 


ening by reconstruction. ee 
Morphology 


tor. ee also Filter (spatial) 


irithmetic, 40 
~catenate, 193 
ivative, 101 
:Abected value, 170 
dentity, 142 
)iacian, 174 
Cal 45 
relatlonal, 44 


tical transfer function (OTE). 


ee Restoration 
”1um statistlical classlfieTs， 
492 
337 
rstatlstlc filters. 9%ee Filter 
(spatial) 
dfilt2, 105 
0 
1 itige, 13, 06, 90, 93 
Fonuzrier transform, 111, 112 
structuring element, 336， 
338-339, 343 
)gonajlity, 243 
of eigenvectors, 4735 
JTF See Restoration 
”3sf, 142 
LSs method, 405 
erSegmentation, 420, 422 


darray, 97 

8dsize. 117 

1 ing.9ee Function 

iTameter Space.ee Hough 
transform 

Littic components, 398 

1 h, 196 

tth 

hetween Pixels, 360 

ATLAEB, 8 

.in.See Recognltion 

DF ee Probabilty denslty 
function 


Pel. 14 
Pepper nolise, 162 
Periodic 
noise. ee Noise 
Sequence, 119 
Periodicity 110 
permute, 486 
persistent, 353 
Phase angle, 109 
pi, 48 
Picture element, 14 
Pixel 14 
pixeldup, 168 
Pixval, 17 
pPLot, 37, 80 
Plotting 
1-D functions, 37, 76-81 
2-D functions, 132--133 
PNG, 15 
Point detection, 379 
Pont Spread function (PSF).9ee 
Restoration 
Polsson. 9ee Noise 
po1l2cart, 451 
polyang1les,S10 
Polygonal apPIoximation, 439 
PNG, 15 
pow2, 300 
Power Spectrum, 109, 170 
Predicate, 408 
Prewitt edge detector. 9ee 下 dge 
Primary color. ee Color 
Principal components transform， 
473 
princomp,477 
print,23 
Probability density function 
(PDF), 81, 144 
Erlang, 146 
estimating parameters, 133 
exponential, 146 
Gaussian, 38, 86. 143, 146, 493 
generating random numbers， 
144, 148 
hlistogram equalization, 81 
in coOIor transformations, 220 


lognormal. 146 

multivarlate, 493 

Rayleigh, 146 

Salt 人 Pepper, 146 

tabje of 146 

unliformn, 146 
Prod, 98 


了 Prompt, 1 


Pruning. ee Morphology 

Pseudocoior.see Coior 

psf2otf, 142 

Psychovisual redundancy. ee 
CompPpresslon 


Q 


qtdecomp, 413 
qtgetbLk, 413 
Quadimages, 412 
CQuadregions, 412 
Quadtree, 412 

Qnuajlity,in JPG Images, 19 
Quantization, 13. See at5o 

Compresslon 


R 


rand, 38, 144, 145 

randn, 38,144, 147 

Random number, 145 

randvertex, 910 

Rayleigh. ee Probabillity density 

function 

Reading Images, 14 

real, 1135 

Fealmax, 48 

realmin, 48 

Recognition, 484--313 
adaptive learning, 498 
Bayes classifier, 492 
Correlation, 490 
decision boundary, 488 
decislon function, 488 
decision-theoretic methods， 

488 

discriminant functlon, 488 
distance measures, 483 





机 
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Ci 


matching regular eXpreSsions， 
302 
matching, 489-492, 902， 
minlimum-distance classifier， 
489 
neural networks, 498 
Pattern class, 484 
Pattern vectors, 488 
Pattern, 484 
Tegular expPressjons, 501 
String matching, 908 
string representation, 499 
string Similarity, 908 
Structural methods, 498 
Structural, 498-512 
tralining, 488 
References, organlzation of 工 
Reflection of structuring element， 
336 
regexp, 902 
regexpi, 903 
regexprep, 3S03 
Reglon, 3, 426. See also 
Description 
border, 420 
boundary, 426 
ciassifying in multispectraj 
Imagery, 495 
contour, 426 
of interest (ROD, 156 
Region-based segmentation, 407 
regiongrow, 409 
Region growing, 408-41 
adjacency, 408,413 
regionprops, 463 
Region splitting and mergling， 
412-417 
adjacency, 408,413 
Registered Images, 476 
Regular exXpressions, 901. 9ee 
Recognition 
Regujiarized filtering. ee 
Restoration 
Relational operators. See 
OPerators 
rem, 236 


rep1licate, 94 
repmat, 204 
Representation, 436-433 
boundary segments, 452 
cOnvVex deficlency, 492 
convex hul ,432 
Freeman chaln codes, 436 
minimum perlmeter poiygons， 
439 
poiygonal, 439 
Silgnatures, 449 
skeletons, 453 
reshape, 300 
Resolution, 21 
Response, of filter, 89, 379 
Restoration, 141-193 
blind deconvolution, 179 
constrained least Squares 
fijtering, 173 
deconvolution, 142, 179 
degradation modeling, 142， 
106-109 
geometric transformations， 
182-190 
image registration, 191--193 
inverse filter, 169 
Inverse fiitering, 169 
iterative techniques, 176-182 
Lucy-Richardson algorithm， 
176 
nolse generation and modeling， 
143-158 
nolse reductiort 1$8-166 
nonjlinear, iterative, 176 
optical transfer function 
(OTF), 142 
point Spread function (PSE)， 
142 
regularlized fijlterimg, 173 
Richardson-Lucy algorithm， 
176 
using spatial filters, 158 
Wiener deconvolution, 173 
Wiener filter, parametric, 171 
Wiener filtering, 170 
Retrieving a worKk Session, 10 
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Peturn, 49 

RGB. see Color 

RGB color space. See Cojor 

rgb2gray, 20.202 

rgb2hsi, 212 

rgb2hsv, 206 

rgb2ind, 200, 201 

rgb2ntsc, 204 

rgb2ycbcr, 205 

rgbcube, 195 

Richardson-Lucy algorithm 176 

Rms (root-mean-square) error， 
285 

Roberts edge detector. See Edsge 

ROI. ee Region 

roipoly, 1S6 

Pot90, 94 

round, 22 

Row vector, 14 

Rows, 13 

Rubber-sheet transformation 1f82 
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salt 人 pepper noise, See Noise 
salt noise, 162 
Sanme, 94 
Sampling, 13 
Saturation. See Color 
Save, 301 
Saving a work session, 10 
Scalar, 14 
Scaling, 18 
Screen capttare, 17 
Search path, 9 
Secondary color. See Cojior 
Second-order derivatives, 385 
Seed points, 408 
Segmentation, 378_425 

color, 237 

double edges, 385 

edge detection, 384-393. See 

also Edge 
edge location, 385 
extended minima transform 
424 


ee 
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xzmentation fcOAL ) 
:Xternal mmarkers, 422 
?obaj thresholding, 404-407 
Tough transform, 393-404. ee 
also Hough transform 
nRGB Vector Space, 237 
ine detection, 381 
ocal thresholding, 407 
)oint detection, 379 
人 EYE 408 
2gion 、 ngand mergling, 412 
sgion-basedu, 417 
ratersShed segmentatlon、 
417--425 
ratershed transformy, 417 
-intersecting polygon. 441 
1icoion, 19 
operations, 335, 337 
?mplement, 335 
ifference, 336 
itersection, 335 
nion, 333 
path, 9 
,78 
field, 3406 
ding interp, 135 
pe numbers, 450 
rpening frequency domain 
filters, 136 
rpening, 101 
了 g, 92 
1, 320 
al PrOCessing, 4 
iature, 450 
atuTes, 449 
darity, 237, 408, 489, 508 
ple polygon, 440 
le subscript, 33 
]1e、24 
leton dimension, 37 
3, 13.15 
etons. 4S3. 9ee 
Representation 
e, 09 
othness. 466 


Sobel, See Edge, Filtering (SPatial) 


sort, 293 
Sortrows, 433 
Sparse matrices, 393 
Sparse, 393 
Spatial 
coordinates, 2, 14 
domain， 109 
filter. See Filter (Spatial) 
jinvariance, 142 
Processing, 63, 213 
Speckle noise. See NoiSe 
Spectral measures of texture， 468 
3fectrum features， 408 
Spectrum 103 
specxture. 409 
Speed comparisons, 5 7 
spfilt, 199 
spline, 218 
splLitmerge,414 
sprintf, 32 
Standard arrays, 37 
Standard deviation, 86, 100, 130， 
148,388, 466 
Statistical error function, 170 
Statistical moments, 462 
statmomentSs, 1535 
statxture, 467 
stem, 79 
str2num, 60 
strcmp, 5S04 
strcnmp, 602 
strcmpi, 3106 
strcmpi, 904 
Strel object, 342 
strel,.341 
strfind, 3S05 
Strings, 498 
manipulation functions, 500 
matching, 308 
measure of similarity, 508 
strjust, 5305 
strmatch, 903 
strncmp, 504 
strncmpi, 505 


strread,01 

strrep, 3506 

strsimilarity, 509 

strtok, 306 

Structural recognition. ee 
Recognition 

Structures, 21, 48, 63, 430 

Structuring element. see 
Morphology 

stPvcat, 504 

subplot,249 

Surf, 134 

Surface area. gee Morphology 

Switch, 49 

symmetric, 94 


计 


Tagged Image File Format， 人 
Template, 89 
text, 79 
Texture. See DescTIiption 
Tform structure, 183 
tformfwd, 184 
tforminv, 184 
Thinning. See Morphology 
Thresholding. See Segmentation 
二: S7 
Tie points, 191 
下 本 及 二 
tit1e, 79 
tDc, 57 
Tophat. See Morphoiogy 
Transform coding.5ee 
Compression 
Transforms 
Discrete cosine, 318 
Eourier, See Discrete FourTieT 
Wavelet. See Wavejets 
Transpose Operator, 5ee 
Operators 
Trees, 498 
true,38,410 
try, . ,Catch, 49 
twomodegauss, 80 


vv nr-orvneee 


Ge 
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U 


uizcontrol. 934 
Untt16,.24 
Uint32, 24 
ULnt8, 16 
uUint8.24 
uUiresume, 540 
Uiwait, 540 
Unary 

minus, 41 

Plus, 41 
Uniform 

linear motion, 167 

random numbers. 9ee PDF- 

uniform noise. See Noise . 
Union.9ee Set OpeIations 
Unique, 433 
unravel.c, 305 
uUnravel .mm, 3006 
Umsharp, 100 
Upper, 62 


V 


varargin;, 72 
varargout, 72 
Variable 
number of inputs, 71 
number of outputs, /1 
Variables. See MATLAB 


Variance, 38, 143， 146, 147, 153， 


155,158, 466 
Vector 
complex conjugate， 41 
Euclidean norm, 174 
indexing, 30 
transpose, 41 
Vectorizing, 36,32,55 
speed comparison, 57 
loops, 553 
version, 48 
View, 132 
vistformfwd, 185 














让 


OFTWard and inverse 
transformation 二 
243 。 





9 外 CEWT using 订 a 计 ar 250 
本 EWT Using the Wavelet 


Tooibox， 246 
| FwrTs without the Wavelet 
Toolbox, 252 
in image Processing, 276 
jinveftse fast wavelet transform， 
271 
kernel properties, 244 
kernel, 244 
mother wavelet, 244 
progresSive reconstructlon， 
279 
Toolbox, 4, 242 
Waverec2,.271 
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er Se Restoration 
g， 132. 9ee cl50 





-BPlotting 
Work session， saving and 
retrieving, 10 
WVO kspace， 8 

browser, 7,8， 10.11 
Variables， 10 








Wraparound erToT， 116 
Writing images， 18 


又 Window Dump, 15 
x2majoraxis, 457 
xlLabel1,79 

xlLim, 80 

xor, 46 

Xxtick, 78 

XWD, 15 


Y 


YCbCr color Space. ee 
Color 

ycbcr2rgb,205 

y1Label1, 79 

y1im, 80 

ytick, 78 


Z 


Zero-crossings detectoT， 388 
Zero-padding, 91, 93， 116 
Zero-phase-shitt filters: 22 
zeros, 38 





